基本信息
浏览量:713
职业迁徙
个人简介
I have always been working at the boundary of new machine learning methods and their application to novel challenges: neural forecasting systems for financial trading and sales rate prediction (‘George’, 1994; ‘Bild-Zeitung’, 1998 - 2008), self-learning agents that control self-driving cars (at Stanford, 2006) or reading thoughts and even controlling brain activity ('BrainLinks BrainTools', 2011 - 2019). Brainstormers, our robotic soccer team, was a 5 times winner of the RoboCup World Championship and one of the first teams to use reinforcement learning (RL) as their core method (1998 - 2008). The data-efficient reinforcement learning algorithms Neural Fitted Q Iteration (NFQ, 2005; NFQCA, 2011) and Deep Fitted Q (DFQ, 2010) laid the ground for many methods in current Artificial Intelligence (AI) research.
研究兴趣
论文共 225 篇作者统计合作学者相似作者
按年份排序按引用量排序主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
Dhruva Tirumala,Markus Wulfmeier,Ben Moran,Sandy Huang,Jan Humplik,Guy Lever,Tuomas Haarnoja,Leonard Hasenclever,Arunkumar Byravan, Nathan Batchelor, Neil Sreendra, Kushal Patel,
arxiv(2024)
引用0浏览0引用
0
0
CoRR (2023): 133-150
引用0浏览0EI引用
0
0
ICLR 2024 (2023)
引用0浏览0EI引用
0
0
Daniel J. Mankowitz,Andrea Michi,Anton Zhernov,Marco Gelmi,Marco Selvi,Cosmin Paduraru,Edouard Leurent,Shariq Iqbal,Jean-Baptiste Lespiau,Alex Ahern, Thomas Köppe, Kevin Millikin,
Natureno. 7964 (2023): 257-263
加载更多
作者统计
合作学者
合作机构
D-Core
- 合作者
- 学生
- 导师
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn