ÆÈ·Î¿ì
Leslie Kaelbling
Leslie Kaelbling
¼Ò¼ÓÀ» ¾Ë ¼ö ¾øÀ½
csail.mit.eduÀÇ À̸ÞÀÏ È®ÀεÊ
Á¦¸ñ
Àοë
Àοë
¿¬µµ
Reinforcement learning: A survey
LP Kaelbling, ML Littman, AW Moore
Journal of artificial intelligence research 4, 237-285, 1996
102141996
Planning and acting in partially observable stochastic domains
LP Kaelbling, ML Littman, AR Cassandra
Artificial intelligence 101 (1-2), 99-134, 1998
50701998
Acting optimally in partially observable stochastic domains
AR Cassandra, LP Kaelbling, ML Littman
Aaai 94, 1023-1028, 1994
9501994
Learning policies for partially observable environments: Scaling up
ML Littman, AR Cassandra, LP Kaelbling
Machine Learning Proceedings 1995, 362-370, 1995
9401995
Learning in embedded systems
LP Kaelbling
MIT press, 1993
9241993
Acting under uncertainty: Discrete Bayesian models for mobile-robot navigation
AR Cassandra, LP Kaelbling, JA Kurien
Proceedings of IEEE/RSJ International Conference on Intelligent Robots and ¡¦, 1996
7531996
On the complexity of solving Markov decision problems
ML Littman, TL Dean, LP Kaelbling
arXiv preprint arXiv:1302.4971, 2013
7142013
Hierarchical task and motion planning in the now
LP Kaelbling, T Lozano-Pérez
2011 IEEE International Conference on Robotics and Automation, 1470-1477, 2011
6922011
To transfer or not to transfer
MT Rosenstein, Z Marx, LP Kaelbling, TG Dietterich
NIPS 2005 workshop on transfer learning 898 (3), 2005
6232005
Effective reinforcement learning for mobile robots
WD Smart, LP Kaelbling
Proceedings 2002 IEEE International Conference on Robotics and Automation ¡¦, 2002
5522002
An architecture for intelligent reactive systems
LP Kaelbling
Reasoning about actions and plans, 395-410, 1987
5041987
The synthesis of digital machines with provable epistemic properties
SJ Rosenschein, LP Kaelbling
Theoretical aspects of reasoning about knowledge, 83-98, 1986
4901986
Generalization in deep learning
K Kawaguchi, LP Kaelbling, Y Bengio
arXiv preprint arXiv:1710.05468, 2017
4262017
Integrated task and motion planning in belief space
LP Kaelbling, T Lozano-Pérez
The International Journal of Robotics Research 32 (9-10), 1194-1227, 2013
4152013
Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons.
D Chapman, LP Kaelbling
Ijcai 91, 726-731, 1991
3991991
Hierarchical solution of Markov decision processes using macro-actions
M Hauskrecht, N Meuleau, LP Kaelbling, TL Dean, C Boutilier
arXiv preprint arXiv:1301.7381, 2013
3982013
Learning to cooperate via policy search
L Peshkin, KE Kim, N Meuleau, LP Kaelbling
arXiv preprint cs/0105032, 2001
3902001
Belief space planning assuming maximum likelihood observations
R Platt Jr, R Tedrake, L Kaelbling, T Lozano-Perez
3802010
Action and planning in embedded agents
LP Kaelbling, SJ Rosenschein
Robotics and autonomous systems 6 (1-2), 35-48, 1990
3651990
Learning to achieve goals
LP Kaelbling
IJCAI 2, 1094-8, 1993
3511993
ÇöÀç ½Ã½ºÅÛÀÌ ÀÛµ¿µÇÁö ¾Ê½À´Ï´Ù. ³ªÁß¿¡ ´Ù½Ã ½ÃµµÇØ ÁÖ¼¼¿ä.
ÇмúÀÚ·á 1–20