Robust machine learning, Multi-armed bandits, Deep neural network regression, GAN, SGD.
Seminars:
1. Titile: From Bandits to RL and to Deepseek-r1 via a Statistical View by Dr. Huiming Zhang
Time: 2025/04/27 20:00-22:00 (GMT+08:00) https://meeting.tencent.com/dm/ic5X6F6ol1HQ Code:529-914-489
Associate Professor
Supervisor of Master's Candidates
E-Mail:
Date of Employment:2022-11-22
School/Department:人工智能学院(人工智能研究院)
Education Level:博士研究生
Business Address:新主楼B1006
Gender:Male
Degree:博士
Status:Employed
Alma Mater:北京大学
The Last Update Time : ..