Yan Duan

인용

	전체	2019년 이후
서지정보	16790	14230
h-index	21	20
i10-index	23	22

3000

1500

750

2250

201520162017201820192020202120222023202452 174 684 1569 2180 2614 2997 2874 2774 788

공개 액세스

모두 보기

자료 8개

자료 0개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Pieter AbbeelUC Berkeley | Covariantcs.berkeley.edu의 이메일 확인됨
(Peter) Xi Chencovariant.ai | UC Berkeleyberkeley.edu의 이메일 확인됨
John SchulmanResearch Scientist, OpenAIopenai.com의 이메일 확인됨
Rein HouthooftNetflix Researchnetflix.com의 이메일 확인됨
Ilya SutskeverCo-Founder and Chief Scientist of OpenAIopenai.com의 이메일 확인됨
Jonathan Hoberkeley.edu의 이메일 확인됨
Haoran TangPhD student in Applied Mathematics; University of California, Berkeleymath.berkeley.edu의 이메일 확인됨
Ken GoldbergProfessor, UC Berkeley and UCSFberkeley.edu의 이메일 확인됨
Sachin PatilNvidianvidia.com의 이메일 확인됨
Ian GoodfellowDeepMinddeepmind.com의 이메일 확인됨
Nicolas PapernotUniversity of Toronto and Vector Instituteutoronto.ca의 이메일 확인됨
Alex X. LeeResearch Scientist, Google DeepMindgoogle.com의 이메일 확인됨
Carlos FlorensaPhD from University of California at Berkeleyberkeley.edu의 이메일 확인됨
Sergey LevineUC Berkeley, Physical Intelligenceeecs.berkeley.edu의 이메일 확인됨
Trevor DarrellProfessor of Computer Science, U.C. Berkeleyeecs.berkeley.edu의 이메일 확인됨
Peter BartlettProfessor, EECS and Statistics, UC Berkeleycs.berkeley.edu의 이메일 확인됨
Jia PanComputer Science, The University of Hong Kongcs.hku.hk의 이메일 확인됨
Ibrahim AwwalPhD Student in Electrical and Computer Engineering, UC San Diegoeng.ucsd.edu의 이메일 확인됨
Diederik P. KingmaResearch Scientist, Google Braingoogle.com의 이메일 확인됨
Prafulla DhariwalResearcher, OpenAIopenai.com의 이메일 확인됨

팔로우

Yan Duan

Covariant.AI

covariant.ai의 이메일 확인됨 - 홈페이지

Robotics Machine Learning Reinforcement Learning Meta Learning


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets X Chen, Y Duan, R Houthooft, J Schulman, I Sutskever, P Abbeel Advances in Neural Information Processing Systems, 2172-2180, 2016	5180	2016
Benchmarking deep reinforcement learning for continuous control Y Duan, X Chen, R Houthooft, J Schulman, P Abbeel International conference on machine learning, 1329-1338, 2016	1970	2016
RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning Y Duan, J Schulman, X Chen, PL Bartlett, I Sutskever, P Abbeel arXiv preprint arXiv:1611.02779, 2016	1072	2016
Adversarial attacks on neural network policies S Huang, N Papernot, I Goodfellow, Y Duan, P Abbeel arXiv preprint arXiv:1702.02284, 2017	928	2017
Vime: Variational information maximizing exploration R Houthooft, X Chen, Y Duan, J Schulman, F De Turck, P Abbeel Advances in neural information processing systems 29, 2016	912	2016
Motion planning with sequential convex optimization and convex collision checking J Schulman, Y Duan, J Ho, A Lee, I Awwal, H Bradlow, J Pan, S Patil, ... The International Journal of Robotics Research 33 (9), 1251-1270, 2014	823	2014
Variational lossy autoencoder X Chen, DP Kingma, T Salimans, Y Duan, P Dhariwal, J Schulman, ... arXiv preprint arXiv:1611.02731, 2016	764	2016
Evaluating protein transfer learning with TAPE R Rao, N Bhattacharya, N Thomas, Y Duan, P Chen, J Canny, P Abbeel, ... Advances in neural information processing systems 32, 2019	751	2019
One-shot imitation learning Y Duan, M Andrychowicz, B Stadie, OAI Jonathan Ho, J Schneider, ... Advances in neural information processing systems 30, 2017	749	2017
Deep Spatial Autoencoders for Visuomotor Learning C Finn, XY Tan, Y Duan, T Darrell, S Levine, P Abbeel International Conference on Robotics and Automation (ICRA), 2016	699*	2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning H Tang, R Houthooft, D Foote, A Stooke, X Chen, Y Duan, J Schulman, ... arXiv preprint arXiv:1611.04717, 2016	664	2016
Model-ensemble trust-region policy optimization T Kurutach, I Clavera, Y Duan, A Tamar, P Abbeel arXiv preprint arXiv:1802.10592, 2018	501	2018
Flow++: Improving flow-based generative models with variational dequantization and architecture design J Ho, X Chen, A Srinivas, Y Duan, P Abbeel International conference on machine learning, 2722-2730, 2019	454	2019
Stochastic neural networks for hierarchical reinforcement learning C Florensa, Y Duan, P Abbeel arXiv preprint arXiv:1704.03012, 2017	406	2017
Deep unsupervised cardinality estimation Z Yang, E Liang, A Kamsetty, C Wu, Y Duan, X Chen, P Abbeel, ... arXiv preprint arXiv:1905.04278, 2019	211	2019
Variance reduction for policy gradient with action-dependent factorized baselines C Wu, A Rajeswaran, Y Duan, V Kumar, AM Bayen, S Kakade, I Mordatch, ... arXiv preprint arXiv:1803.07246, 2018	168	2018
The Importance of Sampling in Meta-Reinforcement Learning B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ... Advances in Neural Information Processing Systems, 9299-9309, 2018	160*	2018
NeuroCard: one cardinality estimator for all tables Z Yang, A Kamsetty, S Luan, E Liang, Y Duan, X Chen, I Stoica arXiv preprint arXiv:2006.08109, 2020	145	2020
Attacking machine learning with adversarial examples I Goodfellow, N Papernot, S Huang, Y Duan, P Abbeel, J Clark OpenAI Blog 24, 1, 2017	76	2017
Sigma hulls for gaussian belief space planning for imprecise articulated robots amid obstacles A Lee, Y Duan, S Patil, J Schulman, Z McCarthy, J Van Den Berg, ... 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2013	45	2013

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자