Jacob Steinhardt

Cited by

	All	Since 2019
Citations	15860	14996
h-index	45	43
i10-index	75	69

4900

2450

1225

3675

20162017201820192020202120222023202466 174 484 996 1576 2140 2767 4888 2576

Public access

View all

24 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Dan HendrycksDirector of the Center for AI SafetyVerified email at berkeley.edu
Dawn SongProfessor of Computer Science, UC BerkeleyVerified email at cs.berkeley.edu
Steven BasartPhD, University of ChicagoVerified email at ttic.edu
Percy LiangAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Christopher OlahAnthropicVerified email at google.com
John SchulmanResearch Scientist, OpenAIVerified email at openai.com
Aditi RaghunathanAssistant professor, Carnegie Mellon UniversityVerified email at cmu.edu
Dario AmodeiCEO and Co-Founder at AnthropicVerified email at anthropic.com
Paul ChristianoNational Institute of Standards and TechnologyVerified email at nist.gov
Gregory ValiantAssistant Professor of Computer Science, Stanford UniversityVerified email at stanford.edu
Pang Wei KohUniversity of WashingtonVerified email at cs.washington.edu
Zachary C. LiptonRaj Reddy Associate Professor of Machine Learning @ Carnegie Mellon University; CTO + CSO @ AbridgeVerified email at cmu.edu
Moses CharikarProfessor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Jerry LiMicrosoft ResearchVerified email at microsoft.com
Daniel KangUIUCVerified email at illinois.edu
Tom B BrownAnthropicVerified email at anthropic.com
Andrew IlyasMassachusetts Institute of TechnologyVerified email at mit.edu
Pravesh K. KothariCarnegie Mellon UniversityVerified email at cs.cmu.edu
Yi SunAssistant Professor, UChicago StatisticsVerified email at statistics.uchicago.edu
Banghua ZhuUniversity of California, BerkeleyVerified email at berkeley.edu

Jacob Steinhardt

Stanford University

Verified email at cs.stanford.edu - Homepage

Machine learning Statistics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Concrete problems in AI safety D Amodei, C Olah, J Steinhardt, P Christiano, J Schulman, D Mané arXiv preprint arXiv:1606.06565, 2016	2549	2016
The many faces of robustness: A critical analysis of out-of-distribution generalization D Hendrycks, S Basart, N Mu, S Kadavath, F Wang, E Dorundo, R Desai, ... Proceedings of the IEEE/CVF international conference on computer vision …, 2021	1248	2021
Natural adversarial examples D Hendrycks, K Zhao, S Basart, J Steinhardt, D Song Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	1176	2021
Measuring massive multitask language understanding D Hendrycks, C Burns, S Basart, A Zou, M Mazeika, D Song, J Steinhardt arXiv preprint arXiv:2009.03300, 2020	1125	2020
Certified defenses against adversarial examples A Raghunathan, J Steinhardt, P Liang arXiv preprint arXiv:1801.09344, 2018	1046	2018
The malicious use of artificial intelligence: Forecasting, prevention, and mitigation M Brundage, S Avin, J Clark, H Toner, P Eckersley, B Garfinkel, A Dafoe, ... arXiv preprint arXiv:1802.07228, 2018	950	2018
Certified defenses for data poisoning attacks J Steinhardt, PWW Koh, PS Liang Advances in neural information processing systems 30, 2017	813	2017
Measuring mathematical problem solving with the math dataset D Hendrycks, C Burns, S Kadavath, A Arora, S Basart, E Tang, D Song, ... arXiv preprint arXiv:2103.03874, 2021	489	2021
Semidefinite relaxations for certifying robustness to adversarial examples A Raghunathan, J Steinhardt, PS Liang Advances in neural information processing systems 31, 2018	464	2018
Troubling Trends in Machine Learning Scholarship: Some ML papers suffer from flaws that could mislead the public and stymie future research. ZC Lipton, J Steinhardt Queue 17 (1), 45-77, 2019	346	2019
Learning from untrusted data M Charikar, J Steinhardt, G Valiant Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing …, 2017	315	2017
Scaling out-of-distribution detection for real-world settings D Hendrycks, S Basart, M Mazeika, A Zou, J Kwon, M Mostajabi, ... arXiv preprint arXiv:1911.11132, 2019	307	2019
Sonyc: A system for monitoring, analyzing, and mitigating urban noise pollution JP Bello, C Silva, O Nov, RL Dubois, A Arora, J Salamon, C Mydlarz, ... Communications of the ACM 62 (2), 68-77, 2019	306	2019
Sever: A robust meta-algorithm for stochastic optimization I Diakonikolas, G Kamath, D Kane, J Li, J Steinhardt, A Stewart International Conference on Machine Learning, 1596-1606, 2019	305	2019
Measuring coding challenge competence with apps D Hendrycks, S Basart, S Kadavath, M Mazeika, A Arora, E Guo, C Burns, ... arXiv preprint arXiv:2105.09938, 2021	291	2021
Aligning ai with shared human values D Hendrycks, C Burns, S Basart, A Critch, J Li, D Song, J Steinhardt arXiv preprint arXiv:2008.02275, 2020	258	2020
Jailbroken: How does llm safety training fail? A Wei, N Haghtalab, J Steinhardt Advances in Neural Information Processing Systems 36, 2024	243	2024
Stronger data poisoning attacks break data sanitization defenses PW Koh, J Steinhardt, P Liang Machine Learning, 1-47, 2022	237	2022
Unsolved problems in ml safety D Hendrycks, N Carlini, J Schulman, J Steinhardt arXiv preprint arXiv:2109.13916, 2021	233	2021
Rethinking bias-variance trade-off for generalization of neural networks Z Yang, Y Yu, C You, J Steinhardt, Y Ma International Conference on Machine Learning, 10767-10777, 2020	187	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors