安德魯・巴托 Andrew Barto

科學家圖靈獎得主教授

關於安德魯・巴托 Andrew Barto

Andrew Barto 是強化學習領域的共同創始人，與學生 Richard Sutton 共同撰寫了經典教科書《Reinforcement Learning: An Introduction》。他在 1980 年代開創性地提出了時序差分學習的理論基礎，並與 Sutton 共同發明了 Actor-Critic 架構。2024 年，他與 Sutton 共同獲得圖靈獎，表彰他們對強化學習領域的開創性貢獻。

職涯經歷

2012-現在

資訊與電腦科學學院名譽教授

麻薩諸塞大學安默斯特分校

1977-2012

資訊與電腦科學學院教授

麻薩諸塞大學安默斯特分校

學歷

密西根大學

理學學士

1970

密西根大學

電腦與通訊科學碩士

1975

密西根大學

電腦與通訊科學博士

1975

重要論文

Neuronlike adaptive elements that can solve difficult learning control problems

IEEE Transactions on Systems, Man, and Cybernetics 1983

Reinforcement Learning: An Introduction

MIT Press 1998

Learning to predict by the methods of temporal differences

Machine Learning 1988

Intrinsically Motivated Reinforcement Learning

NIPS 2004

重要言論

強化學習是理解智慧的計算理論中最核心的一環
關於強化學習重要性的闡述

最讓我興奮的發現是 TD 學習與多巴胺神經元行為的相似性——這顯示我們可能觸及了大腦學習的真正機制
關於強化學習與神經科學的連結

成就與獎項

圖靈獎 (ACM A.M. Turing Award) 計算機協會 (ACM) (2024)

IEEE Neural Networks Pioneer Award IEEE 計算智慧學會 (2004)

IJCAI Award for Research Excellence 國際人工智慧聯合會議 (2017)

關於 安德魯・巴托 Andrew Barto

職涯經歷

資訊與電腦科學學院名譽教授

資訊與電腦科學學院教授

學歷

密西根大學

密西根大學

密西根大學

重要論文

Neuronlike adaptive elements that can solve difficult learning control problems

Reinforcement Learning: An Introduction

Learning to predict by the methods of temporal differences

Intrinsically Motivated Reinforcement Learning

重要言論

成就與獎項

關於安德魯・巴托 Andrew Barto