‪Tiancheng Jin‬ - ‪Google 학술 검색‬

내 프로필 만들기

인용

	전체	2019년 이후
서지정보	410	410
h-index	8	8
i10-index	8	8

0

140

70

35

105

2020202120222023202432 96 100 122 60

공개 액세스

자료 5개

자료 0개

공개

비공개

재정 지원 요구사항 기준

Tiancheng Jin

Tiancheng Jin

Ph.D. student, University of Southern California

usc.edu의 이메일 확인됨

Machine Learning Theory Online Learning Theory RL Theory


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Learning adversarial markov decision processes with bandit feedback and unknown transition C Jin, T Jin, H Luo, S Sra, T Yu International Conference on Machine Learning, 4860-4869, 2020	122*	2020
Deep reinforcement learning for multi-driver vehicle dispatching and repositioning problem J Holler, R Vuorio, Z Qin, X Tang, Y Jiao, T Jin, S Singh, C Wang, J Ye 2019 IEEE International Conference on Data Mining (ICDM), 1090-1095, 2019	114	2019
Simultaneously learning stochastic and adversarial episodic mdps with known transition T Jin, H Luo Advances in neural information processing systems 33, 16557-16566, 2020	58	2020
The best of both worlds: stochastic and adversarial episodic mdps with unknown transition T Jin, L Huang, H Luo Advances in Neural Information Processing Systems 34, 20491-20502, 2021	39	2021
Boosting dynamic programming with neural networks for solving np-hard problems F Yang, T Jin, TY Liu, X Sun, J Zhang Asian Conference on Machine Learning, 726-739, 2018	23	2018
Suvrit Sra, and Tiancheng Yu. Learning adversarial mdps with bandit feedback and unknown transition C Jin, T Jin, H Luo arXiv preprint arXiv:1912.01192, 2019	19	2019
Near-optimal regret for adversarial mdp with delayed bandit feedback T Jin, T Lancewicki, H Luo, Y Mansour, A Rosenberg Advances in Neural Information Processing Systems 35, 33469-33481, 2022	17	2022
Improved best-of-both-worlds guarantees for multi-armed bandits: Ftrl with general regularizers and multiple optimal arms T Jin, J Liu, H Luo Advances in Neural Information Processing Systems 36, 2024	11	2024
Suvrit Sra, and Tiancheng Yu C Jin, T Jin, H Luo Learning adversarial mdps with bandit feedback and unknown transition, 2019	5	2019
No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions T Jin, J Liu, C Rouyer, W Chang, CY Wei, H Luo Advances in Neural Information Processing Systems 36, 2024	2	2024
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback A Rosenberg, H Luo, T Jin, Y Mansour		2022

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–11