Show, Attend and Tell: Neural Image Caption Generation with Visual Attention K Xu, J Ba, R Kiros, K Cho, A Courville, R Salakhutdinov, R Zemel, ... International Conference on Machine Learning (ICML) 2 (3), 5, 2015 | 12130 | 2015 |
Theano: A Python framework for fast computation of mathematical expressions R Al-Rfou, G Alain, A Almahairi, C Angermueller, D Bahdanau, N Ballas, ... arXiv e-prints, arXiv: 1605.02688, 2016 | 910 | 2016 |
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 791 | 2023 |
Probabilistic model-agnostic meta-learning C Finn*, K Xu*, S Levine Advances in neural information processing systems 31, 2018 | 780 | 2018 |
An actor-critic algorithm for sequence prediction D Bahdanau, P Brakel, K Xu, A Goyal, R Lowe, J Pineau, A Courville, ... International Conference on Learning Representations (2017), 2016 | 683 | 2016 |
Meta-dataset: A dataset of datasets for learning to learn from few examples E Triantafillou, T Zhu, V Dumoulin, P Lamblin, U Evci, K Xu, R Goroshin, ... International Conference on Learning Representations (2020), 2019 | 643 | 2019 |
On using monolingual corpora in neural machine translation C Gulcehre, O Firat, K Xu, K Cho, L Barrault, HC Lin, F Bougares, ... arXiv preprint arXiv:1503.03535, 2015 | 632 | 2015 |
Bridging the gap between value and policy based reinforcement learning O Nachum, M Norouzi, K Xu, D Schuurmans Advances in neural information processing systems 30, 2017 | 493 | 2017 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 488 | 2023 |
Theano: A Python framework for fast computation of mathematical expressions TTD Team, R Al-Rfou, G Alain, A Almahairi, C Angermueller, D Bahdanau, ... arXiv preprint arXiv:1605.02688, 2016 | 207 | 2016 |
Unsupervised perceptual rewards for imitation learning P Sermanet*, K Xu*, S Levine Robotics: Science and Systems (RSS 2017), 2016 | 176 | 2016 |
Trust-pcl: An off-policy trust region method for continuous control O Nachum, M Norouzi, K Xu, D Schuurmans International Conference on Learning Representations (2018), 2018 | 138 | 2018 |
On integrating a language model into neural machine translation C Gulcehre, O Firat, K Xu, K Cho, Y Bengio Computer Speech & Language 45, 137-148, 2017 | 125 | 2017 |
Learning a prior over intent via meta-inverse reinforcement learning K Xu, E Ratner, A Dragan, S Levine, C Finn International conference on machine learning, 6952-6962, 2019 | 81 | 2019 |
Reset-free reinforcement learning via multi-task learning: Learning dexterous manipulation behaviors without human intervention A Gupta, J Yu, TZ Zhao, V Kumar, A Rovinsky, K Xu, T Devlin, S Levine 2021 IEEE International Conference on Robotics and Automation (ICRA), 6664-6671, 2021 | 77 | 2021 |
Continual Learning of Control Primitives: Skill Discovery via Reset-Games K Xu*, S Verma*, C Finn, S Levine Advances in neural information processing systems, 2020, 2020 | 37 | 2020 |
Autonomous Reinforcement Learning: Formalism and Benchmarking A Sharma*, K Xu*, N Sardana, A Gupta, K Hausman, S Levine, C Finn International Conference on Learning Representations (2022), 2021 | 22 | 2021 |
Beyond human data: Scaling self-training for problem-solving with language models A Singh, JD Co-Reyes, R Agarwal, A Anand, P Patil, PJ Liu, J Harrison, ... arXiv preprint arXiv:2312.06585, 2023 | 17 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 11 | 2024 |
Small-scale proxies for large-scale transformer training instabilities M Wortsman, PJ Liu, L Xiao, K Everett, A Alemi, B Adlam, JD Co-Reyes, ... arXiv preprint arXiv:2309.14322, 2023 | 9 | 2023 |