Łukasz Kaiser
Łukasz Kaiser
Google Brain & CNRS
liafa.jussieu.fr의 이메일 확인됨 - 홈페이지
제목
인용
인용
연도
Attention is all you need
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint arXiv:1706.03762, 2017
223922017
Tensorflow: A system for large-scale machine learning
M Abadi, P Barham, J Chen, Z Chen, A Davis, J Dean, M Devin, ...
12th {USENIX} symposium on operating systems design and implementation …, 2016
19931*2016
TensorFlow: Large-scale machine learning on heterogeneous systems
M Abadi, A Agarwal, P Barham, E Brevdo, Z Chen, C Citro, GS Corrado, ...
91972015
Google's neural machine translation system: Bridging the gap between human and machine translation
Y Wu, M Schuster, Z Chen, QV Le, M Norouzi, W Macherey, M Krikun, ...
arXiv preprint arXiv:1609.08144, 2016
42642016
Grammar as a foreign language
O Vinyals, Ł Kaiser, T Koo, S Petrov, I Sutskever, G Hinton
Advances in neural information processing systems 28, 2773-2781, 2015
8962015
Multi-task sequence to sequence learning
MT Luong, QV Le, I Sutskever, O Vinyals, L Kaiser
arXiv preprint arXiv:1511.06114, 2015
6502015
Regularizing neural networks by penalizing confident output distributions
G Pereyra, G Tucker, J Chorowski, Ł Kaiser, G Hinton
arXiv preprint arXiv:1701.06548, 2017
5522017
Advances in neural information processing systems
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
Neural Information Processing Systems Foundation, 5998-6008, 2017
4242017
Image transformer
N Parmar, A Vaswani, J Uszkoreit, L Kaiser, N Shazeer, A Ku, D Tran
International Conference on Machine Learning, 4055-4064, 2018
4032018
Tensor2tensor for neural machine translation
A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ...
arXiv preprint arXiv:1803.07416, 2018
3742018
Generating wikipedia by summarizing long sequences
PJ Liu, M Saleh, E Pot, B Goodrich, R Sepassi, L Kaiser, N Shazeer
arXiv preprint arXiv:1801.10198, 2018
3642018
Adding gradient noise improves learning for very deep networks
A Neelakantan, L Vilnis, QV Le, I Sutskever, L Kaiser, K Kurach, J Martens
arXiv preprint arXiv:1511.06807, 2015
3462015
TensorFlow: Large-scale machine learning on heterogeneous systems, software available from tensorflow. org (2015)
M Abadi, A Agarwal, P Barham, E Brevdo, Z Chen, C Citro, GS Corrado, ...
URL https://www. tensorflow. org, 2015
3342015
Universal transformers
M Dehghani, S Gouws, O Vinyals, J Uszkoreit, Ł Kaiser
arXiv preprint arXiv:1807.03819, 2018
3102018
Reformer: The efficient transformer
N Kitaev, Ł Kaiser, A Levskaya
arXiv preprint arXiv:2001.04451, 2020
3082020
Neural gpus learn algorithms
Ł Kaiser, I Sutskever
arXiv preprint arXiv:1511.08228, 2015
2652015
Sentence compression by deletion with lstms
K Filippova, E Alfonseca, CA Colmenares, Ł Kaiser, O Vinyals
Proceedings of the 2015 Conference on Empirical Methods in Natural Language …, 2015
2632015
One model to learn them all
L Kaiser, AN Gomez, N Shazeer, A Vaswani, N Parmar, L Jones, ...
arXiv preprint arXiv:1706.05137, 2017
2502017
Model-based reinforcement learning for atari
L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ...
arXiv preprint arXiv:1903.00374, 2019
2492019
Learning to remember rare events
Ł Kaiser, O Nachum, A Roy, S Bengio
arXiv preprint arXiv:1703.03129, 2017
2382017
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20