基本信息

赵冬斌,男、研究员、博导、Fellow IEEE,中国科学院自动化研究所
电子邮件: dongbin.zhao@ia.ac.cn
通信地址: 海淀区中关村东路95号智能化大厦1005
邮政编码: 100190

研究领域

智能学习控制:深度强化学习,自适应动态规划,强化学习,演化计算,智能游戏,自动机器学习
智能交通:智能驾驶,交通信号控制,车路协同
机器人:移动机器人感知与学习控制,机电一体化系统

招生信息

招生专业1:控制理论与控制工程--群体智能与博弈对抗

招生专业2:模式识别--人工智能理论与方法


招生方向
深度强化学习,自适应动态规划,强化学习,智能控制
智能驾驶,智能游戏,机器人,智能交通,能源管控
神经架构搜索,自动机器学习

教育背景

1996-09--2000-04   哈尔滨工业大学   博士
1994-09--1996-07   哈尔滨工业大学   硕士
1990-09--1994-07   哈尔滨工业大学   学士
出国学习工作
2007年8月-2008年8月,University of Arizona, 访问学者,国家留学基金委公派留学计划。

工作经历

   
工作简历
2014-01~2014-02,新加坡科技研究局, 访问学者
2012-11~现在, 中科院自动化所, 研究员、博导
2002-04~2012-10,中国科学院自动化研究所, 副研、硕导-博导
2000-05~2002-01,清华大学, 博士后
社会兼职
2019-12-11-2019-12-16,The 10th International Conference on Intelligent Control and Information Processing (ICICIP 2019), Marrakesh, Morocco, Program Chair
2019-12-06-2019-12-09,IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2019), Xiamen, China, Program Chair
2019-07-13-2019-07-18,IEEE International Joint Conference on Neural Networks (IJCNN 2019), Budapest, Hungary, Program Co-Chair
2019-05-04-2019-05-06,IEEE International Conference on Computational Intelligence for Financial Engineering and Economics (CIFEr 2019), Shenzhen, China, General Co-Chair
2019-01-01-2019-12-31,IEEE CIS Technical Activities Strategy Planning Sub-Committee, Chair
2018-12-01-2018-12-04,The 25th International Conference on Neural Information Processing (ICONIP 2018), Siem Reap, Cambodia, Dec 1-4, 2018, Tutorial Chair
2018-11-18-2018-11-21,IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2018), Bangalore, India, Nov. 18 -21, 2018, Program Chair
2018-09-01-2019-08-31,IEEE Computation Intelligence Magazine special issue on “Deep Reinforcement Learning and Games”., Lead Guest Chair
2018-06-29-2018-07-06,2018 Eighth International Conference on Information Science and Technology (ICIST 2018), Cordoba, Granada, and Seville, Spain during June 30-July 6, 2018, Program Chair
2018-06-01-今,IEEE Transactions on Neural Networks and Learning Systems special issue on “Deep Reinforcement Learning and Adaptive Dynamic Programming”, Lead Guest Editor
2018-03-01-今,IEEE Transactions on Cybernetics, Associate Editor
2017-11-26-2017-11-30,IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2017), Honolulu, Hawaii, USA, Program Chair
2017-11-13-2017-11-17,The 24th International Conference on Neural Information Processing (ICONIP 2017), Guangzhou, China, Program Chair
2017-07-05-2017-07-27,2017 IEEE CIS Summer School on Computational and Artificial Intelligence, Chair
2017-01-01-今,IEEE计算智能学会北京分会, 主席
2016-12-05-2016-12-08,IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2016), Athens, Greece, Program Chair
2016-07-25-2017-07-29,IEEE World Congress on Computational Intelligence (WCCI 2016), Vancouver, Canada, Publicity Co-chair
2016-06-11-2016-06-14,The 13th World Congress on Intelligent Control and Automation (WCICA 2016), Guilin, China, Program Co-Chair
2015-10-15-2015-10-18,12th International Symposium on Neural Networks (ISNN 2015), Jeju, Korea, Program Co-Chair
2015-04-24-2015-04-26,The 5th International Conference on Information Science and Technology (ICIST 2015), Changsha, China, Program Chair
2015-01-01-今,Artificial Intelligence Review, Associate Editor
2014-12-31-2016-12-31,IEEE计算智能学会自适应动态规划和强化学习技术委员会, 主席
2014-12-31-2015-12-31,IEEE计算智能学会旅行资助委员会, 主席
2014-12-31-2016-12-31,IEEE计算智能学会多媒体委员会, 主席
2014-12-31-2016-12-31,IEEE计算智能学会北京分会, 副主席
2014-12-09-2014-12-12,IEEE Symposiums Series on Computational Intelligence (SSCI 2014), Atlanta, USA, Poster Chair
2014-07-06-2014-07-11,IEEE World Congress on Computational Intelligence (WCCI 2014), Beijing, China, Finance Co-Chair
2014-07-06-2014-07-11,IEEE CIS Summer School on Automated Computational Intelligence, Beijing, China, Chair
2014-01-01-今,IEEE Computational Intelligence Magazine, Associate Editor,
2013-06-09-2013-06-11,The 4th International Conference on Intelligent Control and Information Processing (ICICIP 2013), Beijing, China, Program Chair
2012-12-31-2014-12-30,IEEE CIS Newsletter, Editor,
2012-07-11-2012-07-14,International Symposium on Neural Networks (ISNN 2012), Shenyang, China, Registration Chair
2012-07-11-2012-07-14,Brain Inspired Cognitive Systems (BICS 2012), Shenyang, China, Finance Chair
2012-01-01-今,IEEE Transactions on Neural Networks and Learning Systems, Associate Editor
2011-11-01-今,Cognitive Computation, Associate Editor,
2010-10-01-今,IEEE高级会员,

教授课程

强化学习
智能控制
智能控制理论基础及应用

专利与奖励

   
奖励信息
(1) 2017年度IEEE Transactions on Cognitive and Developmental Systems优秀论文奖(唯一), , 其他, 2019
(2) IEEE Fellow, , 其他, 2019
(3) 《控制理论与应用》优秀编委, , 其他, 2019
(4) 中国人工智能学会优秀博士学位论文指导教师, , 部委级, 2019
(5) 2019年中国AI+创新创业大赛, 一等奖, 部委级, 2019
(6) IJCNN 2018 Best Student Paper Final List, 其他, 2018
(7) 《控制理论与应用》优秀论文奖, 其他, 2018
(8) 前方车辆距离监测第1名,2017年中国智能车未来挑战赛—复杂交通环境认知基础能力离线测试比赛, 一等奖, 国家级, 2017
(9) 前方车辆检测第1名,2017年中国智能车未来挑战赛—复杂交通环境认知基础能力离线测试比赛, 一等奖, 国家级, 2017
(10) 基于数据的非线性系统自学习最优控制理论与方法, 三等奖, 部委级, 2015
(11) 中国科学院“朱李月华优秀教师”奖, , 院级, 2014
(12) 中国石油和化工自动化应用协会科技进步一等奖, 一等奖, 部委级, 2012
(13) 北京市科学技术奖, 三等奖, 省级, 2010
(14) 中国石油和化学工业协会科技进步三等奖, 三等奖, 部委级, 2009
专利成果
[1] 朱圆恒, 柴嘉骏, 赵冬斌. 一种面向空战一对一博弈对抗的飞行器控制方法及装置. CN: CN116699978A, 2023-09-05.
[2] 陈亚冉, 赵冬斌, 张启超. 基于光流和卡尔曼滤波的多目标追踪方法、系统、装置. CN: CN110415277B, 2022-03-08.
[3] 赵冬斌, 唐振韬, 朱圆恒. 多机器人协作对抗方法、装置、电子设备和存储介质. CN: CN113894780A, 2022-01-07.
[4] 朱圆恒, 柴嘉骏, 李伟凡, 赵冬斌, 孙科武, 丁季时雨, 董博. 基于强化学习的变化环境多智能体控制方法与装置. CN: CN113837348A, 2021-12-24.
[5] 朱圆恒, 李伟凡, 熊华, 赵冬斌. 一种基于强化学习的导弹制导方法和装置. CN: CN113239472A, 2021-08-10.
[6] 朱圆恒, 赵冬斌. 基于加速度前馈的异构车队协同自适应巡航控制方法. CN: CN110888322B, 2021-04-13.
[7] 李浩然, 张启超, 赵冬斌. 面向地铁视觉图像的轨道检测方法及系统. CN: CN111611956A, 2020-09-01.
[8] 陈亚冉, 赵晓东, 赵冬斌. 面向智能驾驶的移动目标轨迹预测方法、系统、装置. CN: CN111597961A, 2020-08-28.
[9] 赵冬斌, 邵坤, 朱圆恒. 基于反事实回报的多智能体深度强化学习方法、系统. CN: CN111105034A, 2020-05-05.
[10] 陈亚冉, 赵冬斌, 张启超. 基于光流和卡尔曼滤波的多目标追踪方法、系统、装置. CN: CN110415277A, 2019-11-05.
[11] 张启超, 王俊杰, 赵冬斌. 智能驾驶横向换道决策方法、系统和装置. CN: CN110304045A, 2019-10-08.
[12] 赵冬斌, 李栋, 张启超, 陈亚冉, 朱圆恒. 智能驾驶车道保持方法及系统. CN: CN109466552A, 2019-03-15.
[13] 朱圆恒, 赵冬斌, 李相俊, 周建华. 多电池储能系统的优化控制方法、系统及存储介质. CN: CN109245196A, 2019-01-18.
[14] 赵冬斌, 陈亚冉. 面向驾驶辅助系统的危险目标检测方法、装置. CN: CN107609483A, 2018-01-19.
[15] 赵冬斌, 张启超, 夏中谱. 驾驶员跟车行为分析中的期望跟车距离计算方法. CN: CN107016193A, 2017-08-04.
[16] 赵冬斌, 卜丽, 朱圆恒, 李相俊. 储能电池充/放电异常行为检测方法及检测系统. CN: CN106154180A, 2016-11-23.
[17] 赵冬斌, 张震, 刘德荣. 一种基于稀疏强化学习的传感器网络优化方法. CN: CN103702349A, 2014-04-02.
[18] Zhao, Dongbin. ADAPTIVE CRUISE CONTROL SYSTEM AND METHOD FOR VEHICLE. CN: US20140012479(A1), 2014-01-09.
[19] 赵冬斌, 王滨, 刘德荣. 基于监督式强化学习的最优控制方法. CN: CN103324085A, 2013-09-25.
[20] 赵冬斌, 朱圆恒, 刘德荣. 基于数据的Q函数自适应动态规划方法. CN: CN103217899A, 2013-07-24.
[21] 刘德荣, 魏庆来, 黄玉柱, 赵冬斌. 变换炉的控制方法. CN: CN102830628A, 2012-12-19.
[22] 赵冬斌, 王滨, 刘德荣, 魏庆来, 朱圆恒, 苏永生. 煤气化炉的控制方法. CN: CN102799748A, 2012-11-28.
[23] 赵冬斌, 朱圆恒. 模糊自适应动态规划方法. CN: CN102645894A, 2012-08-22.
[24] 赵冬斌. 车辆自适应巡航控制系统及方法. CN: CN102109821A, 2011-06-29.
[25] 赵冬斌, 李涛, 易建强. 街区路口交通信号优化控制方法. CN: CN101789178A, 2010-07-28.
[26] 赵冬斌, 李涛, 易建强, 张建宏. 单配重式自动水平调节吊具及使用方法. CN: CN101759092A, 2010-06-30.
[27] 易建强, 余 意, 赵冬斌, 张建宏. 绳索牵引自动水平调节吊具及方法. CN: CN101633478A, 2010-01-27.
[28] 易建强, 项炎平, 赵冬斌. 一种双旋配重式自动水平调节吊具系统及控制方法. CN: CN101468776A, 2009-07-01.
[29] 赵冬斌, 徐 冬, 易建强, 张小成. 一种极坐标方式水平自动调节吊具及方法. CN: CN101450767A, 2009-06-10.
[30] 易建强, 项炎平, 赵冬斌. 一种自动化立体仓库框架结构. CN: CN101407271A, 2009-04-15.
[31] 易建强, 张小成, 赵冬斌, 徐 冬. 一种正交式水平自动调节吊具及方法. CN: CN101397114A, 2009-04-01.
[32] 刘伟荣, 易建强, 赵冬斌. 一种应用于互联网的网络拥塞控制系统及方法. CN: CN101166140A, 2008-04-23.
[33] 易建强, 赵冬斌, 李新春, 邓旭玥, 李佳宁. 一种移动机械手控制系统. CN: CN100361792C, 2008-01-16.
[34] 谭湘敏, 易建强, 赵冬斌. 一种移动机器人的位姿传感系统及其方法. CN: CN101049697A, 2007-10-10.
[35] 赵冬斌, 易建强. 火灾救援机器人系统及其方法. CN: CN1994495A, 2007-07-11.
[36] 赵冬斌, 易建强. 火灾抢险机器人系统及其方法. CN: CN1978004A, 2007-06-13.
[37] 赵冬斌, 易建强. 火灾救援机器人系统. CN: CN2889642Y, 2007-04-18.
[38] 易建强, 洪义平, 赵冬斌. 机器人视觉导航中的自然目标检测方法. CN: CN1873656A, 2006-12-06.
[39] 易建强, 洪义平, 赵冬斌. 鲁棒的自然图像分割方法. CN: CN1873657A, 2006-12-06.
[40] 赵冬斌, 易建强. 转球式洗衣机及方法. CN: CN1869315A, 2006-11-29.
[41] 易建强, 钟志光, 赵冬斌. 一种结合传感技术的射频卡门禁系统. CN: CN1866301A, 2006-11-22.
[42] 易建强, 钟志光, 赵冬斌. 一种基于射频技术的病理监测系统. CN: CN1853556A, 2006-11-01.
[43] 易建强, 赵冬斌, 程 金. 一种自动舵航向控制系统及其方法. CN: CN1800782A, 2006-07-12.
[44] 易建强, 赵冬斌, 程 金. 一种航迹自动舵控制系统及其方法. CN: CN1800781A, 2006-07-12.
[45] 易建强, 赵冬斌. 线材自动点数机. CN: CN1778639A, 2006-05-31.
[46] 易建强, 赵冬斌, 李新春, 邓旭玥, 李佳宁. 一种移动机械手控制系统. CN: CN2747031Y, 2005-12-21.
[47] 易建强, 赵冬斌. 线材自动点数机. CN: CN2743236Y, 2005-11-30.
[48] 易建强, 洪义平, 赵冬斌. 门牌号自动识别系统及方法. CN: CN1684096A, 2005-10-19.
[49] 易建强, 洪义平, 赵冬斌. 一种实时IC卡数字字符识别与校验系统及方法. CN: CN1684097A, 2005-10-19.
[50] 易建强, 刘殿通, 赵冬斌. 吊车全自动控制系统. CN: CN1613747A, 2005-05-11.
[51] 易建强, 刘殿通, 赵冬斌. 吊车半自动控制系统. CN: CN1613746A, 2005-05-11.
[52] 钟志光, 易建强, 赵冬斌. 多用途指纹识别保存柜系统. CN: CN1614621A, 2005-05-11.
[53] 钟志光, 易建强, 赵冬斌. 含指纹识别钥匙柜的网络指纹门禁系统. CN: CN1612150A, 2005-05-04.
[54] 赵冬斌, 易建强. 桌上曲棍球机器人系统. CN: CN1611282A, 2005-05-04.
[55] 赵冬斌, 易建强, 宋佐时, 邓旭玥. 移动机械手系统. CN: CN1611331A, 2005-05-04.
[56] 易建强, 钟志光, 赵冬斌. 银行保管箱系统. CN: CN1570340A, 2005-01-26.
[57] 易建强, 刘殿通, 赵冬斌. 吊车全自动控制系统. CN: CN2663387Y, 2004-12-15.
[58] 易建强, 刘殿通, 赵冬斌. 吊车半自动控制系统. CN: CN2659859Y, 2004-12-01.
[59] 赵冬斌, 易建强. 桌上曲棍球机器人系统. CN: CN2649274Y, 2004-10-20.
[60] 赵冬斌, 易建强, 宋佐时, 邓旭玥. 移动机械手系统. CN: CN2645862Y, 2004-10-06.

出版信息


发表论文
[1] Chai, Jiajun, Chen, Wenzhang, Zhu, Yuanheng, Yao, ZongXin, Zhao, Dongbin. A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2023, 53(9): 5417-5429, http://dx.doi.org/10.1109/TSMC.2023.3270444.
[2] Chai, Jiajun, Li, Weifan, Zhu, Yuanheng, Zhao, Dongbin, Ma, Zhe, Sun, Kewu, Ding, Jishiyu. UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2023, 34(4): 2093-2104, http://dx.doi.org/10.1109/TNNLS.2021.3105869.
[3] Li, Nannan, Chen, Yaran, Li, Weifan, Ding, Zixiang, Zhao, Dongbin, Nie, Shuai. BViT: Broad Attention-Based Vision Transformer. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2023, http://dx.doi.org/10.1109/TNNLS.2023.3264730.
[4] 卢毅, 陈亚冉, 赵冬斌, 刘暴, 来志超, 王超楠. 关键点图对比图像分类方法. 智能系统学报[J]. 2023, 18(1): 36-46, http://sciencechina.cn/gw.jsp?action=detail.jsp&internal_id=7442151&detailType=1.
[5] Hu, Guangzheng, Li, Haoran, Liu, Shasha, Zhu, Yuanheng, Zhao,Dongbin. NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks. 2023 International Joint Conference on Neural Networks(IJCNN)null. 2023, [6] Ding, Zixiang, Chen, Yaran, Li, Nannan, Zhao, Dongbin, Sun, Zhiquan, Chen, C L Philip. BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2022, 33(9): 5004-5018, http://dx.doi.org/10.1109/TNNLS.2021.3067028.
[7] Zhang, Qichao, Gao, Yinfeng, Zhang, Yikang, Guo, Youtian, Ding, Dawei, Wang, Yunpeng, Sun, Peng, Zhao, Dongbin. TrajGen: Generating Realistic and Diverse Trajectories With Reactive and Feasible Agent Behaviors for Autonomous Driving. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS[J]. 2022, 23(12): 24474-24487, http://dx.doi.org/10.1109/TITS.2022.3202185.
[8] 刘民颂, 李论通, 邵帅, 朱圆恒, 赵冬斌. Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning. Ieee transactions on cognitive and developmental systems[J]. 2022, [9] Zhu, Yuanheng, Li, Weifan, Zhao, Mengchen, Hao, Jianye, Zhao, Dongbin. Empirical Policy Optimization for n-Player Markov Games. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2022, http://dx.doi.org/10.1109/TCYB.2022.3179775.
[10] Zhu, Yuanheng, Zhao, Dongbin. Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2022, 33(3): 1228-1241, http://dx.doi.org/10.1109/TNNLS.2020.3041469.
[11] Li, Nannan, Pan, Yu, Chen, Yaran, Ding, Zixiang, Zhao, Dongbin, Xu, Zenglin. Heuristic rank selection with progressively searching tensor ring network. COMPLEX & INTELLIGENT SYSTEMS[J]. 2022, 8(2): 771-785, http://dx.doi.org/10.1007/s40747-021-00308-x.
[12] Ding, Zixiang, Chen, Yaran, Li, Nannan, Zhao, Dongbin. BNAS-v2: Memory-Efficient and Performance-Collapse-Prevented Broad Neural Architecture Search. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2022, 52(10): 6259-6272, http://dx.doi.org/10.1109/TSMC.2022.3143201.
[13] Li, Haoran, Chen, Yaran, Zhang, Qichao, Zhao, Dongbin. BiFNet: Bidirectional Fusion Network for Road Segmentation. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2022, 52(9): 8617-8628, http://dx.doi.org/10.1109/TCYB.2021.3105488.
[14] 唐振韬, 梁荣钦, 朱圆恒, 赵冬斌. 实时格斗游戏的智能决策方法. 控制理论与应用. 2022, 39(6): 969-985, https://d.wanfangdata.com.cn/periodical/kzllyyy202206001.
[15] Wang, Junjie, Zhang, Qichao, Zhao, Dongbin. Highway Lane Change Decision-Making via Attention-Based Deep Reinforcement Learning. IEEE-CAA JOURNAL OF AUTOMATICA SINICAnull. 2022, 9(3): 567-569, http://dx.doi.org/10.1109/JAS.2021.1004395.
[16] Li, Haoran, Duan, Zicheng, Li, Jiaqi, Ma, Mingjun, Chen, Yaran, Zhao, Dongbin. Neurons Perception Dataset for RoboMaster AI Challenge. 2022 IEEE World Congress on Computational Intelligence (WCCI)null. 2022, [17] Li, Weifan, Zhu, Yuanheng, Zhao, Dongbin. Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target. COMPLEX & INTELLIGENT SYSTEMS[J]. 2022, 8(2): 1205-1216, http://dx.doi.org/10.1007/s40747-021-00577-6.
[18] Lu, Yi, Chen, Yaran, Zhao, Dongbin, Liu, Bao, Lai, Zhichao, Chen, Jianxin. CNN-G: Convolutional Neural Network Combined With Graph for Image Segmentation With Theoretical Analysis. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS[J]. 2021, 13(3): 631-644, http://dx.doi.org/10.1109/TCDS.2020.2998497.
[19] Chai, Jiajun, Li, Weifan, Zhu, Yuanheng, Zhao, Dongbin, Ma, Zhe, Sun, Kewu, Ding, Jishiyu. UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2021, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000733450200001.
[20] Li, Haoran, Chen, Yaran, Zhang, Qichao, Zhao, Dongbin. BiFNet: Bidirectional Fusion Network for Road Segmentation. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2021, http://dx.doi.org/10.1109/TCYB.2021.3105488.
[21] Zhu, Yuanheng, Zhao, Dongbin, He, Haibo. Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING[J]. 2021, 18(3): 1097-1108, http://dx.doi.org/10.1109/TASE.2020.2996018.
[22] Li, Weifan, Zhu, Yuanheng, Zhao, Dongbin. Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target. COMPLEXINTELLIGENTSYSTEMS[J]. 2021, 12-, [23] Lu, Yi, Chen, Yaran, Zhao, Dongbin, Li, Dong. MGRL: Graph neural network based inference in a Markov network with reinforcement learning for visual navigation. NEUROCOMPUTING[J]. 2021, 421: 140-150, http://dx.doi.org/10.1016/j.neucom.2020.07.091.
[24] Ding, Zixiang, Chen, Yaran, Li, Nannan, Zhao, Dongbin, Sun, Zhiquan, Chen, C L Philip. BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS. 2021, http://dx.doi.org/10.1109/TNNLS.2021.3067028.
[25] Hu, Guangzheng, Zhu, Yuanheng, Zhao, Dongbin, Zhao, Mengchen, Hao, Jianye. Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2021, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000732283100001.
[26] Mu, Chaoxu, Wang, Ke, Zhang, Qichao, Zhao, Dongbin. Hierarchical optimal control for input-affine nonlinear systems through the formulation of Stackelberg game. INFORMATION SCIENCES[J]. 2020, 517: 1-17, http://dx.doi.org/10.1016/j.ins.2019.12.078.
[27] Li, Haoran, Zhang, Qichao, Zhao, Dongbin. Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2020, 31(6): 2064-2076, http://dx.doi.org/10.1109/TNNLS.2019.2927869.
[28] Shao, Kun, Zhu, Yuanheng, Tang, Zhentao, Zhao, Dongbin, IEEE. Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2020, [29] Liu, Minsong, Zhu, Yuanheng, Zhao, Dongbin, IEEE. An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2020, [30] Zhu, Yuanheng, Zhao, Dongbin, He, Haibo. Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2020, 50(11): 3959-3971, http://dx.doi.org/10.1109/TSMC.2019.2911900.
[31] Zhao Dongbin. Advances in Deep Neural Information Processing - Editorial. Neurocomputing. 2020, [32] Zhao, Dongbin, Duan, Shukai, Yan, Zheng, Alippi, Cesare. Advances in deep neural information processing. NEUROCOMPUTING. 2020, 408: 80-81, http://dx.doi.org/10.1016/j.neucom.2020.01.001.
[33] Zhu, Yuanheng, He, Haibo, Zhao, Dongbin. LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS[J]. 2020, 21(11): 4516-4525, https://www.webofscience.com/wos/woscc/full-record/WOS:000587709700003.
[34] Zhu, Yuanheng, Zhao, Dongbin, He, Haibo. Synthesis of Cooperative Adaptive Cruise Control With Feedforward Strategies. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY[J]. 2020, 69(4): 3615-3627, https://www.webofscience.com/wos/woscc/full-record/WOS:000530284400009.
[35] Zhao Dongbin. A spatial-temporal LSTM model for human trajectory prediction. IEEE/CAA Journal of Automation Sinica. 2020, [36] Zhao, Xiaodong, Chen, Yaran, Guo, Jin, Zhao, Dongbin. A spatial-temporal attention model for human trajectory prediction. IEEE-CAA JOURNAL OF AUTOMATICA SINICA[J]. 2020, 7(4): 965-974, http://dx.doi.org/10.1109/JAS.2020.1003228.
[37] Wang, Xu, Liu, Jingwei, Wu, Chaoyong, Liu, Junhong, Li, Qianqian, Chen, Yufeng, Wang, Xinrong, Chen, Xinli, Pang, Xiaohan, Chang, Binglong, Lin, Jiaying, Zhao, Shifeng, Li, Zhihong, Deng, Qingqiong, Lu, Yi, Zhao, Dongbin, Chen, Jianxin. Artificial intelligence in tongue diagnosis: Using deep convolutional neural network for recognizing unhealthy tongue with tooth-mark. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL[J]. 2020, 18: 973-980, https://doaj.org/article/dd1c9f66bc8e44049bdda09f7ddac7ce.
[38] Lu, Yi, Chen, Yaran, Zhao, Dongbin, Chen, Jianxin, Lu, H, Tang, H, Wang, Z. Graph-FCN for Image Semantic Segmentation. ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT Inull. 2019, 11554: 97-105, [39] Zhu, Yuanheng, Zhao, Dongbin, Li, Xiangjun, Wang, Ding. Control-Limited Adaptive Dynamic Programming for Multi-Battery Energy Storage Systems. IEEE TRANSACTIONS ON SMART GRID[J]. 2019, 10(4): 4235-4244, https://www.webofscience.com/wos/woscc/full-record/WOS:000472577500065.
[40] Zhao Dongbin. Adaptive optimal control of cooperative adaptive cruise control with uncertain heterogeneous vehicles. IEEE Control System Technology. 2019, [41] Shao, Kun, Zhu, Yuanheng, Zha, Dongbin. StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE[J]. 2019, 3(1): 73-84, http://dx.doi.org/10.1109/TETCI.2018.2823329.
[42] Gao, Yinfeng, Liu, Yuqi, Zhang, Qichao, Wang, Yu, Zhao, Dongbin, Ding, Dawei, Pang, Zhonghua, Zhang, Yueming, IEEE. Comparison of Control Methods Based on Imitation Learning for Autonomous Driving. 2019 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP)null. 2019, 274-281, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000613247000046.
[43] Zhu, Yuanheng, He, Haibo, Zhao, Dongbin, Hou, Zhongsheng, IEEE. Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2019, [44] Su, Hao, Chen, Yaran, Tong, Shiwen, Zhao, Dongbin, IEEE. Real-time multiple object tracking based on optical flow. 2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019)null. 2019, 350-356, [45] Chen, Yaran, Zhao, Dongbin, Li, Haoran, IEEE. Deep Kalman Filter with Optical Flow for Multiple Object Tracking. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC)null. 2019, 3036-3041, [46] Lv, Le, Zhao, Dongbin, Shao, Kun. Deep sparse representation-based mid-level visual elements discovery in fine-grained classification. SOFT COMPUTING[J]. 2019, 23(18): 8711-8722, http://dx.doi.org/10.1007/s00500-018-3468-3.
[47] Li, Dong, Zhao, Dongbin, Zhang, Qichao, Chen, Yaran. Reinforcement Learning and Deep Learning Based Lateral Control for Autonomous Driving. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE[J]. 2019, 14(2): 83-98, http://ir.ia.ac.cn/handle/173211/23517.
[48] Wang, Bin, Zhao, Dongbin, Cheng, Jin. Adaptive cruise control via adaptive dynamic programming with experience replay. SOFT COMPUTING[J]. 2019, 23(12): 4131-4144, http://ir.ia.ac.cn/handle/173211/24396.
[49] Zhang, Qichao, Zhao, Dongbin. Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2019, 49(8): 2874-2885, http://ir.ia.ac.cn/handle/173211/24567.
[50] Wang, Junjie, Zhang, Qichao, Zhao, Dongbin, Chen, Yaran, IEEE. Lane Change Decision-making through Deep Reinforcement Learning with Rule-based Constraints. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2019, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000530893803042.
[51] Zhu, Yuanheng, Zhao, Dongbin, Zhong, Zhiguang. Adaptive Optimal Control of Heterogeneous CACC System With Uncertain Dynamics. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY[J]. 2019, 27(4): 1772-1779, [52] Zhang, Qichao, Luo, Rui, Zhao, Dongbin, Luo, Chaomin, Qian, Dianwei, IEEE. Model-Free Reinforcement Learning based Lateral Control for Lane Keeping. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2019, [53] Tang Zhentao, Shao Kun, Zhu Yuanheng, Li Dong, Zhao Dongbin, Huang Tingwen, Sundaram S. A Review of Computational Intelligence for StarCraft AI. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI)null. 2018, 1167-1173, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000459238800159.
[54] Zhao, Xiaodong, Zhang, Qichao, Zhao, Dongbin, Pang, Zhonghua, Sun, MX, Zhang, HG. Overview of Image Segmentation and Its Application on Free Space Detection. PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS)null. 2018, 1164-1169, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000450645900210.
[55] Chen, Yaran, Zhao, Dongbin, Lv, Le, Zhang, Qichao. Multi-task learning for dangerous object detection in autonomous driving. INFORMATION SCIENCES[J]. 2018, 432(*): 559-571, http://dx.doi.org/10.1016/j.ins.2017.08.035.
[56] Zhao Dongbin, Li Haoran, Li Dong, Guo Ping, Chen Yaran. A Temporal-based Deep Learning Method for Multiple Objects Detection in Autonomous Driving. 2018, http://ir.ia.ac.cn/handle/173211/23521.
[57] Zhu, Yuanheng, Zhao, Dongbin. Comprehensive comparison of online ADP algorithms for continuous-time optimal control. ARTIFICIAL INTELLIGENCE REVIEW[J]. 2018, 49(4): 531-547, https://www.webofscience.com/wos/woscc/full-record/WOS:000426912500004.
[58] Zhang, Zhen, Wang, Dongqing, Zhao, Dongbin, Han, Qiaoni, Song, Tingting. A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents. IEEE ACCESS[J]. 2018, 6: 70223-70235, http://ir.ia.ac.cn/handle/173211/25665.
[59] Zhao, Dongbin, Liu, Derong, Lewis, F L, Principe, Jose C, Squartini, Stefano. Special Issue on Deep Reinforcement Learning and Adaptive Dynamic Programming. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS. 2018, 29(6): 2038-2041, https://www.webofscience.com/wos/woscc/full-record/WOS:000432398300001.
[60] Li Dong, Zhao Dongbin, Zhang Qichao, Zhu Yuanheng, Sundaram S. An Autonomous Driving Experience Platform with Learning-Based Functions. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI)null. 2018, 1174-1179, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000459238800160.
[61] Yuanheng Zhu, Nannan Li, Kun Shao, Dongbin Zhao. Learning battles in ViZDoom via deep reinforcement learning. 2018, http://ir.ia.ac.cn/handle/173211/23364.
[62] Zhang, Qichao, Zhao, Dongbin, Lewis, Frank L, IEEE. Model-Free Reinforcement Learning for Fully Cooperative Multi-Agent Graphical Games. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2018, [63] Zhu, Yuanheng, Zhao, Dongbin, Yang, Xiong, Zhang, Qichao. Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2018, 48(2): 500-509, https://www.webofscience.com/wos/woscc/full-record/WOS:000422925700005.
[64] Zhang, Qichao, Zhao, Dongbin, Wang, Ding. Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2018, 29(1): 37-50, https://www.webofscience.com/wos/woscc/full-record/WOS:000419558900004.
[65] Shao, Kun, Zhao, Dongbin, Zhu, Yuanheng, Zhang, Qichao, IEEE. Visual Navigation with Actor-Critic Deep Reinforcement Learning. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2018, [66] Chen, Yaran, Zhao, Dongbin, Li, Haoran, Li, Dong, Guo, Ping, IEEE. A temporal-based deep learning method for multiple objects detection in autonomous driving. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2018, [67] Bu, Li, Alippi, Cesare, Zhao, Dongbin. A pdf-Free Change Detection Test Based on Density Difference Estimation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2018, 29(2): 324-334, https://www.webofscience.com/wos/woscc/full-record/WOS:000422952400007.
[68] Wu, I C, Lee, C S, Tian, Y, Mueller, M. Guest Editorial Special Issue on Deep/Reinforcement Learning and Games. IEEE TRANSACTIONS ON GAMESnull. 2018, 10(4): 333-335, https://www.webofscience.com/wos/woscc/full-record/WOS:000453577300001.
[69] Zhao Dongbin. Comprehesive comparison of online ADP algorithms for continuous-time optimal control. Artificial Intelligence Review. 2018, [70] Li, Dong, Zhao, Dongbin, Chen, Yaran, Zhang, Qichao, IEEE. DeepSign: Deep Learning based Traffic Sign Recognition. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2018, [71] Lu, Yi, Chen, Yaran, Zhao, Dongbin, Li, Haoran, IEEE. Hybrid Deep Learning Based Moving Object Detection via Motion prediction. 2018 CHINESE AUTOMATION CONGRESS (CAC)null. 2018, 1442-1447, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000459239501089.
[72] Zhu Yuanheng, Zhang Qichao, Zhao Dongbin, Li Dong. An Autonomous Driving Experience Platform with Learning-Based Functions. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI)null. 2018, 1174-1179, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000459238800160.
[73] Yuanheng Zhu, Qichao Zhang, Dongbin Zhao, Kun Shao. Visual navigation with Actor-Critic deep reinforcement learning. 2018, http://ir.ia.ac.cn/handle/173211/23365.
[74] Zhu, Yuanheng, Zhao, Dongbin, He, Haibo, Ji, Junhong. Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS[J]. 2017, 64(5): 4101-4109, https://www.webofscience.com/wos/woscc/full-record/WOS:000399674000064.
[75] Deng QingQiong, Zhao Dongbin, Lv Le. Image Clustering based on Deep Sparse Representations. 2016 IEEE Symposium Series on Computational Intelligence: SSCI 2016, Athens, Greece, 6-9 December 2016, pages 2037-2712, v.4null. 2017, 2108-2113, http://ir.ia.ac.cn/handle/173211/14471.
[76] Bu Li, Zhao Dongbin, Alippi Cesare. An Incremental Change Detection Test Based on Density Difference Estimation. IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS[J]. 2017, [77] Li, Chengdong, Ding, Zixiang, Zhao, Dongbin, Yi, Jianqiang, Zhang, Guiqing. Building Energy Consumption Prediction: An Extreme Deep Learning Approach. ENERGIES[J]. 2017, 10(10): https://doaj.org/article/97e10cd1f86645f384b67cc9b9f33881.
[78] Zhang, Qichao, Zhao, Dongbin, Zhu, Yuanheng. Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs. NEUROCOMPUTING[J]. 2017, 238(*): 377-386, http://dx.doi.org/10.1016/j.neucom.2017.01.076.
[79] Zhao, Dongbin, Chen, Yaran, Lv, Le. Deep Reinforcement Learning With Visual Attention for Vehicle Classification. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS[J]. 2017, 9(4): 356-367, http://dx.doi.org/10.1109/TCDS.2016.2614675.
[80] Li Dong, Zhao Dongbin, Zhang Qichao, Luo Chaomin, IEEE. Policy Gradient Methods with Gaussian Process Modelling Acceleration. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2017, 1774-1779, [81] Qichao Zhang, Haoran Li, Dongbin Zhao. Comparison of methods to efficient graph SLAM under general optimization framework. YAC 2017null. 2017, *-, http://ir.ia.ac.cn/handle/173211/19422.
[82] Zhao Dongbin. Editorial: new developments in neural network structures for signal processing, autonomous decision, and adaptive controll. IEEE Transactions on Neural Networks and Learning Systems. 2017, [83] 朱圆恒, 赵冬斌, 邵坤. Cooperative Reinforcement Learning for Multiple Units Combat in StarCraft. 2017, http://ir.ia.ac.cn/handle/173211/15399.
[84] Tang Zhentao, Lv Le, Shao Kun, Zhao Dongbin. ADP with MCTS algorithm for Gomoku. 2017, http://ir.ia.ac.cn/handle/173211/14475.
[85] Zhao Dongbin, Wei Qinglai, Alippi Cesare, Bu Li. A Kolmogorov-Smirnov Test to Detect Changes in Stationarity in Big Data. IFAC PAPERSONLINEnull. 2017, 50(1): 14260-14265, http://dx.doi.org/10.1016/j.ifacol.2017.08.1821.
[86] Zhu, Yuanheng, Zhao, Dongbin, Li, Xiangjun. Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2017, 28(3): 714-725, https://www.webofscience.com/wos/woscc/full-record/WOS:000395980500020.
[87] 唐振韬, 邵坤, 赵冬斌, 朱圆恒. 深度强化学习进展: 从 AlphaGo 到 AlphaGo Zero. 控 制 理 论 与 应 用[J]. 2017, 34(12): 1529-1546, http://lib.cqvip.com/Qikan/Article/Detail?id=7000480876.
[88] Zhang Qichao, Zhao Dongbin, Zhu Yuanheng. Event-Triggered $H_\\infty $ Control for Continuous-Time Nonlinear System via Concurrent Learning. IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS[J]. 2017, [89] Lv, Le, Zhao, Dongbin, Deng, Qingqiong. A Semi-Supervised Predictive Sparse Decomposition Based on Task-Driven Dictionary Learning. COGNITIVE COMPUTATION[J]. 2017, 9(1): 115-124, https://www.webofscience.com/wos/woscc/full-record/WOS:000394418100008.
[90] Zhao, Dongbin, Xia, Zhongpu, Zhang, Qichao. Model-free Optimal Control based Intelligent Cruise Control with Hardware-in-the-loop Demonstration. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE[J]. 2017, 12(2): 56-69, http://dx.doi.org/10.1109/MCI.2017.2670463.
[91] Zhao Dongbin. Event-triggered optimal control for nonlinear constrained-input systems with partially unknown dynamics via adaptive dynamic programming. IEEE Transactions on Industrial Electronics. 2017, [92] Shengli Xie, Derong Liu, Dongbin Zhao, ElSayed M ElAlfy, Yuanqing Li. Neural Information Processing. NEURALINFORMATIONPROCESSINGLECTURENOTESINCOMPUTERSCIENCE. 2017, 10636, 10637, 10638, 10639,-, http://ir.ia.ac.cn/handle/173211/19892.
[93] Chen, Yaran, Zhao, Dongbin, Cong, F, Leung, A, Wei, Q. Multi-task Learning with Cartesian Product-Based Multi-objective Combination for Dangerous Object Detection. ADVANCES IN NEURAL NETWORKS, PT Inull. 2017, 10261: 28-35, [94] Zhao Dongbin, Zhang Qichao. Data-driven adaptive dynamic programming for two-player nonzero-sum game. 2017, http://ir.ia.ac.cn/handle/173211/14342.
[95] Bu, Li, Zhao, Dongbin, Alippi, Cesare. An Incremental Change Detection Test Based on Density Difference Estimation. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2017, 47(10): 2714-2726, https://www.webofscience.com/wos/woscc/full-record/WOS:000411098200009.
[96] Zhang, Qichao, Zhao, Dongbin, Zhu, Yuanheng. Event-Triggered H-infinity Control for Continuous-Time Nonlinear System via Concurrent Learning. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2017, 47(7): 1071-1081, https://www.webofscience.com/wos/woscc/full-record/WOS:000404354600004.
[97] Zhang, Zhen, Zhao, Dongbin, Gao, Junwei, Wang, Dongqing, Dai, Yujie. FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2017, 47(6): 1367-1379, http://dx.doi.org/10.1109/TCYB.2016.2544866.
[98] Li Dong, Zhao Dongbin, Zhang Qichao, Luo Chaomin, IEEE. Policy Gradient Methods with Gaussian Process Modelling Acceleration. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2017, 1774-1779, [99] Wang, Ding, Liu, Derong, Zhang, Qichao, Zhao, Dongbin. Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2016, 46(11): 1544-1555, https://www.webofscience.com/wos/woscc/full-record/WOS:000386225800006.
[100] Tang, Yufei, He, Haibo, Ni, Zhen, Zhong, Xiangnan, Zhao, Dongbin, Xu, Xin. Fuzzy-Based Goal Representation Adaptive Dynamic Programming. IEEE TRANSACTIONS ON FUZZY SYSTEMS[J]. 2016, 24(5): 1159-1175, http://dx.doi.org/10.1109/TFUZZ.2015.2505327.
[101] Zhu, Yuanheng, Zhao, Dongbin, Li, Xiangjun. Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics. IET CONTROL THEORY AND APPLICATIONS[J]. 2016, 10(12): 1339-1347, [102] Zhu Yuanheng, Chen Xi, Zhao Dongbin, Zhang Qichao. Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations. 2016, http://ir.ia.ac.cn/handle/173211/14340.
[103] Li Dong, Xia Zhongpu, Zhao Dongbin. A Perturbed Gaussian Process Regression with Chunk Sparsification for Tracking Non-stationary Systems. PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC)null. 2016, 6639-6644, [104] Zhao Dongbin, Alippi Cesare, Bu Li. Ensemble LSDD-based change detection tests. 2016, http://ir.ia.ac.cn/handle/173211/14332.
[105] 周彤, 李栋, 朱圆恒, 王成红, 刘德荣, 王海涛, 陈亚冉, 邵坤, 赵冬斌. 深度强化学习综述:兼论计算机围棋的发展. 控制理论与应用[J]. 2016, 33(6): 701-717, [106] Zhao Dongbin, Wang Haitao, Shao Kun, Zhu Yuanheng, IEEE. Deep Reinforcement Learning with Experience Replay Based on SARSA. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI)null. 2016, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000400488300013.
[107] 孙长银, 王成红, 胡跃明, 赵东斌, 周彤, 苏剑波. “机器智能、系统优化与最优决策”专刊前言. 控制理论与应用[J]. 2016, 33(12): 1553-1554, http://lib.cqvip.com/Qikan/Article/Detail?id=7000119650.
[108] Zhao Dongbin. Model-free iterative adaptive dynamic programming solving unknown nonlinear zero-sum game based on online measurement. IEEE Transactions on Neural Networks and Learning Systems. 2016, [109] ZhuYuanheng, ShaoKun, WangHaitao, 赵冬斌. Deep reinforcement learning with Experience Replay based on SARSA. 2016, http://ir.ia.ac.cn/handle/173211/19877.
[110] Dongbin Zhao, Le Lv, Qingqiong Deng. Image clustering based on the deep sparse representations. COMPUTATIONAL INTELLIGENCE (SSCI), 2016 IEEE SYMPOSIUM SERIES ONnull. 2016, 1-6, http://ir.ia.ac.cn/handle/173211/19423.
[111] Zhao, Dongbin, Zhang, Qichao, Wang, Ding, Zhu, Yuanheng. Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2016, 46(3): 854-865, https://www.webofscience.com/wos/woscc/full-record/WOS:000370963500023.
[112] Chen, Yaran, Zhao, Dongbin, Lv, Le, Li, Chengdong, IEEE. A Visual Attention Based Convolutional Neural Network for Image Classification. PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA)null. 2016, 764-769, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000388373802067.
[113] Bu Li, Alippi Cesare, Zhao Dongbin, IEEE. Ensemble LSDD-based Change Detection Tests. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2016, 4064-4069, [114] Xia, Zhongpu, Zhao, Dongbin. Online reinforcement learning control by Bayesian inference. IET CONTROL THEORY AND APPLICATIONS[J]. 2016, 10(12): 1331-1338, https://www.webofscience.com/wos/woscc/full-record/WOS:000381410000003.
[115] Wang, Ding, Liu, Derong, Zhang, Qichao, Zhao, Dongbin. Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2016, 46(11): 1544-1555, https://www.webofscience.com/wos/woscc/full-record/WOS:000386225800006.
[116] 赵冬斌, 朱圆恒. 概率近似正确的强化学习算法解决连续状态空间控制问题. 控制理论与应用[J]. 2016, 33(12): 1603-1613, http://lib.cqvip.com/Qikan/Article/Detail?id=7000119656.
[117] Ni, Zhen, He, Haibo, Zhao, Dongbin, Xu, Xin, Prokhorov, Danil V. GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2015, 26(3): 614-627, http://dx.doi.org/10.1109/TNNLS.2014.2329942.
[118] Zhao Dongbin, ZhangQichao, Li Chengdong, Wei Qinglai. Consensus of Heterogeneous Multi-agent Systems With Switching Topologies Using Input-output Feedback Linearization. 2015 34th Chinese control conference: CCC 2015, Hangzhou, China, 28-30 July 2015, pages 6414-7296, v.8null. 2015, 6872-6877, http://ir.ia.ac.cn/handle/173211/14338.
[119] Zhang, Qichao, Zhao, Dongbin, Wei, Qinglai, Li, Chengdong, Zhao, Q, Liu, S. Consensus of Heterogeneous Multi-agent Systems With Switching Topologies Using Input-output Feedback Linearization. 2015 34TH CHINESE CONTROL CONFERENCE (CCC)null. 2015, 6872-6877, [120] Squartini, Stefano, Liu, Derong, Piazza, Francesco, Zhao, Dongbin, He, Haibo. Computational Energy Management in Smart Grids. NEUROCOMPUTING. 2015, 170: 267-269, http://dx.doi.org/10.1016/j.neucom.2015.05.110.
[121] Zhu, Yuanheng, Zhao, Dongbin, He, Haibo, Ji, Junhong. Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems. COGNITIVE COMPUTATION[J]. 2015, 7(6): 763-771, http://ir.ia.ac.cn/handle/173211/10525.
[122] 王革, 刘广天, 汪海洪, 巩可欣, 赵冬斌. 能源存储:一种新的方法. 能源存储:一种新的方法. 2015, http://ir.ia.ac.cn/handle/173211/19889.
[123] Zhao, Dongbin, Zhu, Yuanheng. MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2015, 26(2): 346-356, http://www.irgrid.ac.cn/handle/1471x/980893.
[124] Zhu, Yuanheng, Zhao, Dongbin. A data-based online reinforcement learning algorithm satisfying probably approximately correct principle. NEURAL COMPUTING & APPLICATIONS[J]. 2015, 26(4): 775-787, http://www.irgrid.ac.cn/handle/1471x/980902.
[125] 赵冬斌, Yuanheng Zhu. Model-Free Adaptive Algorithm for Optimal Control of Continuous-Time Nonlinear System. 2015, http://ir.ia.ac.cn/handle/173211/15282.
[126] Zhao Dongbin, Zhang Qichao, Li Xiangjun, Kong Lingda, Hu X, Xia Y, Zhang Y, Zhao D. Event-Triggered H-infinity Control for Continuous-Time Nonlinear System. ADVANCES IN NEURAL NETWORKS - ISNN 2015null. 2015, 9377: 62-70, [127] Xu, Xin, He, Haibo, Zhao, Dongbin, Sun, Shiliang, Busoniu, Lucian, Yang, Simon X. Machine Learning with Applications to Autonomous Systems. MATHEMATICAL PROBLEMS IN ENGINEERING[J]. 2015, 2015: http://ir.ia.ac.cn/handle/173211/10500.
[128] Li Dong, Zhao Dongbin, Zhu Yuanheng, Xia Zhongpu, IEEE. Thermal Comfort Control Based on MEC Algorithm for HVAC Systems. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2015, [129] Dongbin Zhao, 夏中谱. Online Reinforcement Learning by Bayesian Inference. PROCEEDINGS OF INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2015null. 2015, http://ir.ia.ac.cn/handle/173211/11434.
[130] 赵冬斌, ZhangZhen. Clique-based cooperative multiagent reinforcement learning using factor graphs. IEEE/CAA JOURNAL OF AUTOMATICA SINICA[J]. 2015, 3(1): 248-256, http://ir.ia.ac.cn/handle/173211/19321.
[131] Zhao, Dongbin, Xia, Zhongpu, Wang, Ding. Model-Free Optimal Control for Affine Nonlinear Systems With Convergence Analysis. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING[J]. 2015, 12(4): 1461-1468, http://dx.doi.org/10.1109/TASE.2014.2348991.
[132] 徐延才, 王澄, 赵冬斌, 刘德荣, 魏庆来. 智能小区商业模式及运营策略分析. 电力系统保护与控制[J]. 2015, 43(6): 147-154, http://lib.cqvip.com/Qikan/Article/Detail?id=663987471.
[133] Zhao Dongbin. Model-free optimal control for affine nonlinear systems based on action dependent heuristic dynamic programming with convergency analysis. IEEE Transactions on Automation and Science Engineering. 2015, [134] Zhu, Yuanheng, Zhao, Dongbin, Liu, Derong. Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems. NEUROCOMPUTING[J]. 2015, 149: 124-131, http://dx.doi.org/10.1016/j.neucom.2013.11.055.
[135] Wang Haitao, Zhao Dongbin, Li Chengdong, IEEE. Online Synchronous Policy Iteration Based on Concurrent Learning to Solve Continuous-time Optimal Control Problem. 2015 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST)null. 2015, 297-302, [136] Li Dong, Xia Zhongpu, Zhu Yuanheng, Zhao Dongbin. Thermal Comfort Control Based on MEC Algorithm for HVAC System. 2015, http://ir.ia.ac.cn/handle/173211/15667.
[137] Haitao Wang, Chengdong Li, Dongbin Zhao. Online Synchronous Policy Iteration Based on Concurrent Learning to Solve Continuous-time Optimal Control Problem. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGYnull. 2015, http://ir.ia.ac.cn/handle/173211/11385.
[138] 王澄, 刘德荣, 魏庆来, 赵冬斌, 夏振超. 带有储能设备的智能电网电能迭代自适应动态规划最优控制. 自动化学报[J]. 2014, 40(9): 1984-1990, https://d.wanfangdata.com.cn/periodical/zdhxb201409018.
[139] Zhongpu Xia, Dongbin Zhao, Huajin Tang. Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System. IFAC PROCEEDINGS VOLUMES. 2014, 47(3): 7049-7054, http://dx.doi.org/10.3182/20140824-6-ZA-1003.01368.
[140] Alippi, Cesare, Liu, Derong, Zhao, Dongbin, Bu, Li. Detecting and Reacting to Changes in Sensing Units: The Active Classifier Case. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2014, 44(3): 353-362, http://dx.doi.org/10.1109/TSMC.2013.2252895.
[141] Li Bu, Dongbin Zhao, Yu Liu, Qiang Guan. A hierarchical classification algorithm for evaluating energy consumption behaviors. INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2014, 1461-1466., http://www.irgrid.ac.cn/handle/1471x/973410.
[142] Zhao, Dongbin, Hu, Zhaohui, Xia, Zhongpu, Alippi, Cesare, Zhu, Yuanheng, Wang, Ding. Full-range adaptive cruise control based on supervised adaptive dynamic programming. NEUROCOMPUTING[J]. 2014, 125: 57-67, http://dx.doi.org/10.1016/j.neucom.2012.09.034.
[143] Bin Wang, Dongbin Zhao. A Kaiman filter-based actor-critic learning approach. INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2014, 3657-3662, http://ir.ia.ac.cn/handle/173211/5167.
[144] Dongbin Zhao. Event-triggered reinforcement learning approach for unknown nonlinear continuous-time system. INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2014, 3677-3784, http://ir.ia.ac.cn/handle/173211/5168.
[145] Yuanheng Zhu, Dongbin Zhao, Haibo He. An high-efficient online reinforcement learning algorithm for continuous-state systems. IEEE WORLD CONGRESSON INTELLIGENT CONTROL AND AUTOMATION (WCICA)null. 2014, 581-586., http://www.irgrid.ac.cn/handle/1471x/973405.
[146] Yuanheng Zhu, 赵冬斌. Online reinforcement learning for continuous-state systems. FRONTIERS OF INTELLIGENT CONTROL AND INFORMATION PROCESSING. 2014, http://ir.ia.ac.cn/handle/173211/15280.
[147] Wang, Bin, Zhao, Dongbin, Alippi, Cesare, Liu, Derong. Dual Heuristic dynamic Programming for nonlinear discrete-time uncertain systems with state delay. NEUROCOMPUTING[J]. 2014, 134: 222-229, http://dx.doi.org/10.1016/j.neucom.2013.06.037.
[148] Dongbin Zhao, 夏中谱. Model-free adaptive dynamic programming for optimal control of discrete-time affine nonlinear system. PROCEEDINGS OF INTERNATIONAL FEDERATION OF AUTOMATIC CONTROL 2014null. 2014, http://ir.ia.ac.cn/handle/173211/11460.
[149] Zhao Dongbin, Fan Zhijiang, Lv Le. Cheating Behavior Detection based-on Pictorial Structure Model. 2014, http://ir.ia.ac.cn/handle/173211/14473.
[150] 王澄, 魏庆来, 赵冬斌, 刘德荣, 夏振超. 基于数据的智能电网电能自适应优化调控. 控制工程[J]. 2014, 21(5): 753-759, https://d.wanfangdata.com.cn/periodical/jczdh201405029.
[151] Zhao, Dongbin, Wang, Bin, Liu, Derong. A supervised Actor-Critic approach for adaptive cruise control. SOFT COMPUTING[J]. 2013, 17(11): 2089-2099, https://www.webofscience.com/wos/woscc/full-record/WOS:000325822900011.
[152] Qian, Dianwei, Zhao, Dongbin, Yi, Jianqiang, Liu, Xiangjie. Neural sliding-mode load frequency controller design of power systems. NEURAL COMPUTING & APPLICATIONS[J]. 2013, 22(2): 279-286, http://www.irgrid.ac.cn/handle/1471x/972971.
[153] Alippi, Cesare, Bu, Li, Zhao, Dongbin, IEEE. A Prior-Free Encode-Decode Change Detection Test to Inspect Datastreams for Concept Drift. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2013, [154] 赵冬斌, Yuanheng Zhu. Online Model-Free RLSPI Algorithm for Nonlinear Discrete-Time Non-affine Systems. 2013, http://ir.ia.ac.cn/handle/173211/15281.
[155] Qian, Dianwei, Yi, Jianqiang, Zhao, Dongbin. How to automatically set an initial angle for balance control of a cart-pole system: an education case. INTERNATIONAL JOURNAL OF ELECTRICAL ENGINEERING EDUCATION[J]. 2013, 50(1): 57-68, https://www.webofscience.com/wos/woscc/full-record/WOS:000327130200005.
[156] Zhao, Dongbin, Alippi, Cesare, Liu, Derong, Zhang, Huaguang. Special issue on intelligent control and information processing. SOFT COMPUTINGnull. 2013, 17(11): 1967-1969, https://www.webofscience.com/wos/woscc/full-record/WOS:000325822900001.
[157] Ding Wang, Derong Liu, Dongbin Zhao, Yuzhu Huang, Dehua Zhang. A neural-network-based iterative GDHP approach for solving a class of nonlinear optimal control problems with control constraints. NEURAL COMPUTING AND APPLICATIONS,. 2013, 22(2): [158] Zhao, Dongbin, Shen, Yi, Wang, Zhanshan, Hu, Xiaolin. Data-based control, optimization, modeling and applications. NEURAL COMPUTING & APPLICATIONSnull. 2013, 23(7-8): 1839-1842, https://www.webofscience.com/wos/woscc/full-record/WOS:000326889800001.
[159] Cesare Alippi, Li Bu, Dongbin Zhao. A prior-free encode-decode change detection test to inspect datastreams for concept drift. INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2013, 926-931, http://www.irgrid.ac.cn/handle/1471x/973406.
[160] Wang, Ding, Liu, Derong, Zhao, Dongbin, Huang, Yuzhu, Zhang, Dehua. A neural-network-based iterative GDHP approach for solving a class of nonlinear optimal control problems with control constraints. NEURAL COMPUTING & APPLICATIONS[J]. 2013, 22(2): 219-227, https://www.webofscience.com/wos/woscc/full-record/WOS:000313657600004.
[161] Wang, Ding, Liu, Derong, Wei, Qinglai, Zhao, Dongbin, Jin, Ning. Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming. AUTOMATICA[J]. 2012, 48(8): 1825-1832, http://dx.doi.org/10.1016/j.automatica.2012.05.049.
[162] 王飞跃, 刘德荣, 熊刚, 程长建, 赵冬斌. 复杂系统的平行控制理论及应用. 复杂系统与复杂性科学[J]. 2012, 9(3): 1-12, http://lib.cqvip.com/Qikan/Article/Detail?id=43387808.
[163] Yuanheng Zhu, Dongbin Zhao, Haibo He. Integration of fuzzy controller with adaptive dynamic programming. IEEE WORLD CONGRESSON INTELLIGENT CONTROL AND AUTOMATION (WCICA)null. 2012, 310-315, http://www.irgrid.ac.cn/handle/1471x/973407.
[164] Dongbin Zhao, 夏中谱. Hybrid feedback control of vehicle longitudinal acceleration. PROCEEDING OF CHINESE CONTROL CONFERENCEnull. 2012, http://ir.ia.ac.cn/handle/173211/11437.
[165] Dongbin Zhao. Reinforcement learning control based on multi-goal representation using hierarchical heuristic dynamic programming. INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2012, 1160-1167, http://ir.ia.ac.cn/handle/173211/5173.
[166] Zhang, Huaguang, Alippi, Cesare, Zhao, Dongbin. Data-driven optimal algorithms and their applications to pattern recognition. NEUROCOMPUTINGnull. 2012, 78(1): 1-2, http://dx.doi.org/10.1016/j.neucom.2011.08.013.
[167] Zhao, Dongbin, Zhang, Zhen, Dai, Yujie. Self-teaching adaptive dynamic programming for Gomoku. NEUROCOMPUTING[J]. 2012, 78(1): 23-29, http://dx.doi.org/10.1016/j.neucom.2011.05.032.
[168] 苏永生, 赵冬斌. 基于OGRE的车辆自适应巡航控制三维仿真. 交通运输系统工程与信息[J]. 2012, 12(2): 47-52,90, http://lib.cqvip.com/Qikan/Article/Detail?id=41747301.
[169] Dongbin Zhao. Data-driven learning and control with multiple critic networks. IEEE WORLD CONGRESSON INTELLIGENT CONTROL AND AUTOMATION (WCICA)null. 2012, 523-527, http://ir.ia.ac.cn/handle/173211/5172.
[170] Zhao Dongbin, Bu Li, Alippi Cesare. SVM-based just-in-time adaptive classifiers. 2012, http://ir.ia.ac.cn/handle/173211/14330.
[171] Liu, Derong, Wang, Ding, Zhao, Dongbin, Wei, Qinglai, Jin, Ning. Neural-Network-Based Optimal Control for a Class of Unknown Discrete-Time Nonlinear Systems Using Globalized Dual Heuristic Programming. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING[J]. 2012, 9(3): 628-634, https://www.webofscience.com/wos/woscc/full-record/WOS:000309841800017.
[172] Zhao, Dongbin, Zhu, Yuanheng, He, Haibo. Neural and Fuzzy Dynamic Programming for Under-actuated Systems. INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2012, http://www.irgrid.ac.cn/handle/1471x/973390.
[173] Zhao, Dongbin, Dai, Yujie, Zhang, Zhen. Computational Intelligence in Urban Traffic Signal Control: A Survey. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWSnull. 2012, 42(4): 485-494, https://www.webofscience.com/wos/woscc/full-record/WOS:000305584800004.
[174] Zhao, Dongbin, Bai, Xuerui, Wang, FeiYue, Xu, Jing, Yu, Wensheng. DHP Method for Ramp Metering of Freeway Traffic. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS[J]. 2011, 12(4): 990-999, http://www.irgrid.ac.cn/handle/1471x/972996.
[175] Dianwei Qian, Jianqiang Yi, Dongbin Zhao. Control of Overhead Crane Systems by Combining Sliding Mode with Fuzzy Regulator. IFAC PROCEEDINGS VOLUMES. 2011, 44(1): 9320-9325, http://dx.doi.org/10.3182/20110828-6-IT-1002.01716.
[176] Wang Ding, Zhao Dongbin. Adaptive dynamic programming for optimal control of unknown nonlinear discrete-time systems. SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSC) - IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL)null. 2011, 242-249, http://ir.ia.ac.cn/handle/173211/4984.
[177] Zhaohui Hu, Dongbin Zhao. Adaptive Cruise Control Based on Reinforcement Leaning with Shaping Rewards. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS[J]. 2011, 15(3): 351-356, [178] 谭湘敏, 易建强, 赵冬斌, 郝银星. 高超声速飞行器轨迹跟踪控制仿真研究. 系统仿真学报[J]. 2011, 23(4): 745-749, http://lib.cqvip.com/Qikan/Article/Detail?id=37228093.
[179] Zhao Dongbin, Zhu Fenghua. Neural network based online traffic signal controller design with reinforcement training. IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC)null. 2011, 1045-1050, http://ir.ia.ac.cn/handle/173211/5017.
[180] Chai, Tianyou, Hou, Zhongsheng, Lewis, Frank L, Hussain, Amir, Zhao, Dongbin. Special Section on Data-Based Control, Modeling, and Optimization. IEEE TRANSACTIONS ON NEURAL NETWORKS. 2011, 22(12): 2150-2153, http://ir.ia.ac.cn/handle/173211/9737.
[181] Zhao Dongbin. Supervised adaptive dynamic programming based adaptive cruise control. SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSC) - IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL)null. 2011, 318-323, http://ir.ia.ac.cn/handle/173211/5035.
[182] Zhao Dongbin. DHP for coordinated freeway ramp metering. IEEE Transactions on Intelligent Transportation Systems. 2011, [183] Zhao Dongbin. Reinforcement learning for multi-agent patrol policy. IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS (ICCI)null. 2010, 530-535, http://ir.ia.ac.cn/handle/173211/5030.
[184] Li Chengdong, Yi Jianqiang, Yu Yi, Zhao Dongbin. Inverse Control of Cable-driven Parallel Mechanism Using Type-2 Fuzzy Neural Network. 自动化学报[J]. 2010, 459-464, http://lib.cqvip.com/Qikan/Article/Detail?id=33164216.
[185] Zhao Dongbin. A traffic signal control algorithm for isolated intersections based on adaptive dynamic programming. INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC)null. 2010, 255-260, http://ir.ia.ac.cn/handle/173211/4980.
[186] Zhao DongBin, Yi JianQiang. Inverse control of cable-driven parallel mechanism using type-2 fuzzy neural network. ZIDONGHUA XUEBAO/ ACTA AUTOMATICA SINICA[J]. 2010, 36(3): 459-464, http://ir.ia.ac.cn/handle/173211/4133.
[187] Xu, Dong, Zhao, Dongbin, Yi, Jianqiang, Tan, Xiangmin. Trajectory Tracking Control of Omnidirectional Wheeled Mobile Manipulators: Robust Neural Network-Based Sliding Mode Approach. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS[J]. 2009, 39(3): 788-799, http://www.irgrid.ac.cn/handle/1471x/973030.
[188] Zhao Dongbin. Coordinated control of multiple ramps metering based on ADHDP (λ) Controller. International Journal of Innovative Computing, Information and Control. 2009, [189] Li, Chengdong, Yi, Jianqiang, Zhao, Dongbin. Design of interval type-2 fuzzy logic system using sampled data and priorknowledge. ICIC EXPRESS LETTERS[J]. 2009, 3(3): 695-700, http://www.irgrid.ac.cn/handle/1471x/972962.
[190] Bai, Xuerui, Zhao, Dongbin, Yi, Jianqiang. THE APPLICATION OF ADHDP(lambda) METHOD TO COORDINATED MULTIPLE RAMPS METERING. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL[J]. 2009, 5(10B): 3471-3481, http://www.irgrid.ac.cn/handle/1471x/973026.
[191] Liu, Weirong, Yi, Jianqiang, Zhao, Dongbin. GENETIC ALGORITHM-BASED FUZZY CONTROLLER TO AVOID NETWORK CONGESTION. INTELLIGENT AUTOMATION AND SOFT COMPUTING[J]. 2009, 15(2): 237-248, http://www.irgrid.ac.cn/handle/1471x/973003.
[192] 余意, 易建强, 李成栋, 赵冬斌, 张建宏. 绳索牵引自动水平调节机器人综合控制策略. 电机与控制学报[J]. 2009, 13(6): 891-896, http://lib.cqvip.com/Qikan/Article/Detail?id=32476928.
[193] Zhao DongBin, Yi JianQiang. An overview on the adaptive dynamic programming based urban city traffic signal optimal control. ZIDONGHUA XUEBAO/ ACTA AUTOMATICA SINICA[J]. 2009, 35(6): 676-681, http://ir.ia.ac.cn/handle/173211/3823.
[194] 徐冬, 赵冬斌, 易建强, 谭湘敏. 全方位移动机械手运动控制Ⅱ——鲁棒控制. 机械工程学报[J]. 2009, 45(1): 42-49, http://lib.cqvip.com/Qikan/Article/Detail?id=29284695.
[195] 赵冬斌, 刘德荣, 易建强. 基于自适应动态规划的城市交通信号优化控制方法综述. 自动化学报[J]. 2009, 676-681, http://lib.cqvip.com/Qikan/Article/Detail?id=30708441.
[196] 谭湘敏, 赵冬斌, 易建强, 徐冬. 全方位移动机械手运动控制Ⅰ——建模与控制. 机械工程学报[J]. 2009, 45(1): 35-41, http://lib.cqvip.com/Qikan/Article/Detail?id=29284694.
[197] Zhao Dongbin, Tan Xiangmin, Yi Jianqiang. Motion control of omnidirectional mobile manipulators (Part II) - Robust control. JIXIE GONGCHENG XUEBAO/JOURNAL OF MECHANICAL ENGINEERING[J]. 2009, 45(1): 42-49, http://ir.ia.ac.cn/handle/173211/3530.
[198] Zhao, Dongbin, Deng, Xuyue, Yi, Jianqiang. Motion and Internal Force Control for Omnidirectional Wheeled Mobile Robots. IEEE-ASME TRANSACTIONS ON MECHATRONICS[J]. 2009, 14(3): 382-387, http://www.irgrid.ac.cn/handle/1471x/973009.
[199] Yi Jianqiang, Tan Xiangmin, Zhao Dongbin. Motion control of omnidirectional mobile manipulators (Part I) - Modeling and control. JIXIE GONGCHENG XUEBAO/JOURNAL OF MECHANICAL ENGINEERING[J]. 2009, 45(1): 35-41, http://ir.ia.ac.cn/handle/173211/3529.
[200] Li Tao, Zhao Dongbin, Yi Jianqiang, IEEE. Adaptive Dynamic Neuro-fuzzy System for Traffic Signal Control. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8null. 2008, 1840-1846, [201] 余意, 易建强, 赵冬斌, 张建宏. 绳索牵引自动水平调节机器人的设计. 华中科技大学学报:自然科学版[J]. 2008, http://lib.cqvip.com/Qikan/Article/Detail?id=1000600790.
[202] Zhang, Nan, Wang, FeiYue, Zhu, Fenghua, Zhao, Dongbin, Tang, Shuming. DynaCAS: Computational Experiments and Decision Support for ITS. IEEE INTELLIGENT SYSTEMS[J]. 2008, 23(6): 19-23, http://dx.doi.org/10.1109/MIS.2008.101.
[203] 李成栋, 易建强, 赵冬斌. 一种新的针对平移振荡器系统的模糊控制方法. 重庆工学院学报:自然科学版[J]. 2008, 22(9): 47-51, http://lib.cqvip.com/Qikan/Article/Detail?id=28377890.
[204] Qian, Dianwei, Yi, Jianqiang, Zhao, Dongbin. Hierarchical sliding mode control for a class of SIMO under-actuated systems. CONTROL AND CYBERNETICS[J]. 2008, 37(1): 159-175, http://www.irgrid.ac.cn/handle/1471x/1006044.
[205] Zhao Dongbin, Yi Jianqiang, Deng Xuyue, IEEE. Motion regulation of redundantly actuated omni-directional wheeled mobile robots with internal force control. 2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9null. 2007, 3925-3930, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000254073202166.
[206] Hong, Yiping, Yi, Jianqiang, Zhao, Dongbin. Improved mean shift segmentation approach for natural images. APPLIED MATHEMATICS AND COMPUTATION[J]. 2007, 185(2): 940-952, http://dx.doi.org/10.1016/j.amc.2006.07.038.
[207] Sun, Zhanquan, Xi, Guangcheng, Yi, Jianqiang, Zhao, Dongbin. Select informative symptoms combination for diagnosing syndrome. JOURNAL OF BIOLOGICAL SYSTEMS[J]. 2007, 15(1): 27-37, http://www.irgrid.ac.cn/handle/1471x/1005876.
[208] Li Tao, Zhao Dongbin, Yi Jianqiang, Liu D, Fei S, Hou ZG, Zhang HG, Sun CY. Application of ADP to intersection signal control. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGSnull. 2007, 4491: 374-+, [209] 李新春, 赵冬斌, 易建强. 一种末端任务给定的移动机械手动态路径规划方法. 控制与决策[J]. 2007, 184-188,194, http://lib.cqvip.com/Qikan/Article/Detail?id=1000022854.
[210] 李佳宁, 易建强, 赵冬斌, 西广成. 一种新的基于神经模糊推理网络的复杂系统模糊辨识方法. 自动化学报[J]. 2006, 32(5): 695-703, http://lib.cqvip.com/Qikan/Article/Detail?id=22739716.
[211] 李新春, 赵冬斌, 易建强. 一种全方位移动机械手的可操作度分析. 中国机械工程[J]. 2006, 17(14): 1442-1447, http://lib.cqvip.com/Qikan/Article/Detail?id=22336893.
[212] 李新征, 易建强, 赵冬斌, 洪义平. 一种改进的自然图像分割方法. 计算机应用研究[J]. 2006, 23(5): 251-254, http://lib.cqvip.com/Qikan/Article/Detail?id=21720653.
[213] Zhao, Dongbin, Yi, Jianqiang, Wang, J, Yi, Z, Zurada, JM, Lu, BL, Yin, H. A particle swarm optimized fuzzy neural network control for acrobot. ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 2, PROCEEDINGS[J]. 2006, 3972(?): 1160-1165, http://www.irgrid.ac.cn/handle/1471x/1005784.
[214] Zhao, DongBin, Yi, JianQiang, Chen, H, Wang, FY, Yang, CC, Zeng, D, Chau, M, Chang, K. Analysis of infectious disease data based on evolutionary computation. INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS[J]. 2006, 3917: 179-180, http://www.irgrid.ac.cn/handle/1471x/1005763.
[215] 王伟, 易建强, 赵冬斌, 柳晓菁. 一类非确定欠驱动系统的串级模糊滑模控制. 控制理论与应用[J]. 2006, 23(1): 53-59, http://lib.cqvip.com/Qikan/Article/Detail?id=21307327.
[216] 李新春, 赵冬斌, 易建强. 基于OpenGL的移动机械手路径规划仿真. 系统仿真学报[J]. 2006, 18(1): 196-199, http://lib.cqvip.com/Qikan/Article/Detail?id=21071848.
[217] Zhao, Dongbin, Yi, Jianqiang, Jiao, L, Wang, L, Gao, X, Liu, J, Wu, F. Robot planning with artificial potential field guided ant colony optimization algorithm. ADVANCES IN NATURAL COMPUTATION, PT 2[J]. 2006, 4222: 222-231, http://ir.ia.ac.cn/handle/173211/9227.
[218] 洪义平, 易建强, 赵冬斌, 李新征. 自然图像分割方法及其在目标检测中的应用. 模式识别与人工智能[J]. 2006, 19(5): 658-662, https://d.wanfangdata.com.cn/periodical/mssbyrgzn200605017.
[219] Deng, Xuyue, Zhao, Dongbin, Yi, Jianqiang, Wen, John T, IEEE. Motion and squeeze force control for omnidirectional wheeled mobile robots. 2006 AMERICAN CONTROL CONFERENCE, VOLS 1-12null. 2006, 1-12: COVER4-COVER5, [220] Zhong Zhiguang, Yi Jianqiang, Zhao Dongbin. A Robust Two Feature Points Based Depth Estimation Method. ACTA AUTOMATICA SINICA[J]. 2005, 31(5): 693-698, http://sciencechina.cn/gw.jsp?action=detail.jsp&internal_id=2085065&detailType=1.
[221] 赵冬斌, 杨晟刚, 易建强, 张彦. 煤气化炉的仿真系统开发. 系统仿真学报[J]. 2005, 17(5): 1258-1260, http://lib.cqvip.com/Qikan/Article/Detail?id=15645163.
[222] Hong Yiping, Yi Jianqiang, Zhao Dongbin. 门牌识别系统中的鲁棒性分割方法. 自动化学报[J]. 2005, 31(3): 359-364, http://lib.cqvip.com/Qikan/Article/Detail?id=15964393.
[223] 王伟, 易建强, 赵冬斌, 柳晓菁. 基于稳定性分析的一类欠驱动系统的滑模控制器设计. 信息与控制[J]. 2005, 34(2): 232-235, http://lib.cqvip.com/Qikan/Article/Detail?id=15407684.
[224] 王伟, 易建强, 赵冬斌, 柳晓菁. 一种新型神经网络滑模控制器的设计. 电机与控制学报[J]. 2005, 9(6): 603-606, http://lib.cqvip.com/Qikan/Article/Detail?id=20632490.
[225] 钟志光, 易建强, 赵冬斌. 一种鲁棒的基于两个特征点的深度估计方法. 自动化学报[J]. 2005, 31(5): 693-698, http://lib.cqvip.com/Qikan/Article/Detail?id=20220950.
[226] 程玉虎, 易建强, 王雪松, 赵冬斌. 一类连续状态与动作空间下的加权Q学习. 电机与控制学报[J]. 2005, 9(6): 570-574, http://lib.cqvip.com/Qikan/Article/Detail?id=20632482.
[227] 宋佐时, 易建强, 赵冬斌, 李新春. 基于神经网络的一类非线性系统自适应滑模控制. 电机与控制学报[J]. 2005, 9(5): 481-485,490, http://lib.cqvip.com/Qikan/Article/Detail?id=20217003.
[228] 李新征, 易建强, 赵冬斌. 基于视觉的机器人定位精度提高方法. 计算机测量与控制[J]. 2005, 13(6): 545-547, http://lib.cqvip.com/Qikan/Article/Detail?id=15891384.
[229] 钟志光, 易建强, 赵冬斌, 洪义平, 李新征. 一种鲁棒的只需两帧图像的姿态估计方法. 模式识别与人工智能[J]. 2005, 18(4): 424-428, http://lib.cqvip.com/Qikan/Article/Detail?id=20045319.
[230] 钟志光, 易建强, 赵冬斌. 一种基于点对的相机几何标定方法. 机器人[J]. 2005, 27(1): 31-34,40, http://lib.cqvip.com/Qikan/Article/Detail?id=11392100.
[231] 王伟, 易建强, 赵冬斌, 刘殿通. Pendubot的一种分层滑模控制方法. 控制理论与应用[J]. 2005, 22(3): 417-422, http://lib.cqvip.com/Qikan/Article/Detail?id=15893765.
[232] 李新春, 赵冬斌, 易建强, 宋佐时. 一种移动机械手分级协调路径规划方法. 制造业自动化[J]. 2005, 27(5): 28-32, http://lib.cqvip.com/Qikan/Article/Detail?id=15702372.
[233] 易建强, 赵冬斌, 张耀华, 洪义平. 实时卡片字符识别与校验系统的设计与实现. 计算机工程与应用[J]. 2005, 41(17): 113-115, http://lib.cqvip.com/Qikan/Article/Detail?id=15759864.
[234] 柳晓菁, 易建强, 赵冬斌, 王伟. 基于最小二乘支持向量机的自适应逆扰动消除控制系统. 控制与决策[J]. 2005, 20(8): 947-950, http://lib.cqvip.com/Qikan/Article/Detail?id=18080586.
[235] 钟志光, 易建强, 赵冬斌. 一种基于点对的深度和运动估计方法. 机器人[J]. 2005, 27(2): 137-141, http://lib.cqvip.com/Qikan/Article/Detail?id=15237907.
[236] 邓旭玥, 易建强, 赵冬斌. 一种全方位移动机器人的控制方法. 电机与控制学报[J]. 2005, 9(2): 139-144, http://lib.cqvip.com/Qikan/Article/Detail?id=15171263.
[237] 张文增, 赵冬斌, 陈强, 都东. 基于模型的人体运动参数检测. 生物医学工程学杂志[J]. 2005, 22(1): 147-150, http://lib.cqvip.com/Qikan/Article/Detail?id=20330430.
[238] 李佳宁, 易建强, 赵冬斌, 西广成. 一种基于强化学习的在线神经模糊控制系统. 中国科学院研究生院学报[J]. 2005, 22(5): 631-638, http://lib.cqvip.com/Qikan/Article/Detail?id=20175987.
[239] 柳晓菁, 易建强, 赵冬斌. 基于Lyapunov稳定理论设计MRAC系统的简单方法. 系统仿真学报[J]. 2005, 17(8): 1933-1935, http://lib.cqvip.com/Qikan/Article/Detail?id=18004742.
[240] 李新春, 赵冬斌, 易建强, 宋佐时. 移动机械手结构设计. 可编程控制器与工厂自动化(PLC FA)[J]. 2004, 103-106, http://lib.cqvip.com/Qikan/Article/Detail?id=11498378.
[241] 王伟, 易建强, 赵冬斌, 刘殿通. 桥式吊车系统的分级滑模控制方法. 自动化学报[J]. 2004, 30(5): 784-788, http://lib.cqvip.com/Qikan/Article/Detail?id=10878442.
[242] 钟志光, 易建强, 赵冬斌, 洪义平. 基于运动视的移动机器人定位方法. 机器人[J]. 2004, 26(4): 325-329, http://lib.cqvip.com/Qikan/Article/Detail?id=10340405.
[243] 程玉虎, 易建强, 赵冬斌. 机器人行为协调机制研究进展. 机器人[J]. 2004, 26(2): 187-192, http://lib.cqvip.com/Qikan/Article/Detail?id=9211465.
[244] 王骞, 易建强, 赵冬斌, 吴并臻. 一类网络控制系统的建模及分析. 控制工程[J]. 2004, 11(6): 529-532, http://lib.cqvip.com/Qikan/Article/Detail?id=11084214.
[245] 宋佐时, 易建强, 赵冬斌, 李新春. 基于MATLAB的非完整动力学系统跟踪控制的动态仿真. 系统仿真学报[J]. 2004, 16(7): 1433-1436, http://lib.cqvip.com/Qikan/Article/Detail?id=10343658.
[246] 邓旭玥, 易建强, 赵冬斌. 一种全方位移动机器人的运动学分析. 机器人[J]. 2004, 26(1): 49-53, http://lib.cqvip.com/Qikan/Article/Detail?id=8797014.
[247] 柳晓菁, 易建强, 赵冬斌, 王伟. 一种基于RBF网络的非线性自适应逆控制系统. 控制与决策[J]. 2004, 19(10): 1175-1177,1182, http://lib.cqvip.com/Qikan/Article/Detail?id=10659975.
[248] 易建强, 郑耀林, 赵冬斌, 王伟. DC—DC变换器的模糊神经网络控制方法研究. 系统仿真学报[J]. 2004, 16(11): 2567-2570,2574, http://lib.cqvip.com/Qikan/Article/Detail?id=10886005.
[249] 张文增, 陈强, 孙振国, 赵冬斌. 具有形状自适应的欠驱动拟人机器人手指. 机械工程学报[J]. 2004, 40(10): 115-118, http://lib.cqvip.com/Qikan/Article/Detail?id=10852131.
[250] 王伟, 易建强, 赵冬斌, 刘殿通. 基于滑模方法的桥式吊车系统的抗摆控制. 控制与决策[J]. 2004, 19(9): 1013-1016,1021, http://lib.cqvip.com/Qikan/Article/Detail?id=10476198.
[251] 李佳宁, 易建强, 赵冬斌, 邓旭玥, 西广成. 一种全方位移动机械手的体系结构设计与分析. 机器人[J]. 2004, 26(3): 272-276, http://lib.cqvip.com/Qikan/Article/Detail?id=9871639.
[252] 邓旭玥, 王伟, 易建强, 赵冬斌. 移动机器人导航研究现状及其发展趋势展望. 可编程控制器与工厂自动化(PLC FA)[J]. 2004, 4-8, http://lib.cqvip.com/Qikan/Article/Detail?id=11498326.
[253] 钟志光, 易建强, 赵冬斌, 洪义平. 一种基于目标识别的运动视定位方法. 模式识别与人工智能[J]. 2004, 17(3): 337-341, http://lib.cqvip.com/Qikan/Article/Detail?id=10656789.
[254] 曹一鹏, 张文增, 赵冬斌. 拟人机器人上肢运动检测系统的研制. 传感器技术[J]. 2003, 22(6): 14-16, http://lib.cqvip.com/Qikan/Article/Detail?id=8005642.
[255] 赵冬斌, 易建强, 邓旭玥. 全方位移动机器人结构和运动分析. 机器人[J]. 2003, 25(5): 394-398, http://lib.cqvip.com/Qikan/Article/Detail?id=8293612.
[256] 张文增, 陈强, 孙振国, 徐济民, 赵冬斌. 变抓取力的欠驱动拟人机器人手. 清华大学学报:自然科学版[J]. 2003, 43(8): 1143-1147, http://lib.cqvip.com/Qikan/Article/Detail?id=8288527.
[257] 陈善本, 赵冬斌, 娄亚军, 吴林. 脉冲GTAW焊缝成形智能控制方法. 自动化学报[J]. 2003, 29(1): 130-134, http://lib.cqvip.com/Qikan/Article/Detail?id=7315471.
[258] 宋佐时, 易建强, 赵冬斌. 移动机械手控制研究进展. 机器人[J]. 2003, 25(5): 465-469,480, http://lib.cqvip.com/Qikan/Article/Detail?id=8293652.
[259] 赵冬斌, 易建强, 张文增, 陈强, 都东. 拟人机器人TH-1手臂运动学. 机器人[J]. 2002, 24(6): 502-507, http://lib.cqvip.com/Qikan/Article/Detail?id=11420678.
[260] Yi Lu, Yaran Chen, Dongbin Zhao, Dong Li. GMRL: Graph neural network based inference in a Markov network with Reinforcement Learning for visual navigation. NEUROCOMPUTING. http://dx.doi.org/10.1016/j.neucom.2020.07.091.
发表著作
(1) 全方位移动机器人导论, An introduction to Omnidirectinoal Mobile Robots, 科学出版社, 2010-05, 第 1 作者
(2) 机器人手册,第26章-面向操作任务的运动, Springer Handbook of Robotics, Chapter 26 - Motion for Manipulation Tasks, 机械工业出版社, 2013-01, 第 1 作者
(3) 机器人手册,第51章-智能车辆, Springer Handbook of Robotics, Chapter 51 - Intelligent Vehicles, 机械工业出版社, 2013-01, 第 1 作者
(4) Advances in Brain Inspired Cognitive Systems, Advances in Brain Inspired Cognitive Systems, Springer Heidelberg Dordrecht London NewYork, 2013-06, 第 3 作者
(5) Frontiers of Intelligent Control and Information Processing, Frontiers of Intelligent Control and Information Processing, World Scientific Publishing, 2014-11, 第 3 作者
(6) Advances in Neural Networks – ISNN 2015, Springer Heidelberg Dordrecht London NewYork, 2015-04, 第 4 作者
(7) Neural Information Processing, Lecture Notes in Computer Science 10636, 10637, 10638, 10639, Springer Heidelberg Dordrecht London NewYork, 2017-10, 第 4 作者

科研活动

   
科研项目
( 1 ) 基于数据的非线性控制系统分析与设计, 参与, 国家级, 2011-01--2014-12
( 2 ) 汽车的智慧起停巡航控制, 主持, 省级, 2012-01--2014-12
( 3 ) 基于监督式ADP 的汽车智能巡航控制, 主持, 国家级, 2013-01--2016-12
( 4 ) 能源管控中心平行控制节能技术研究, 参与, 省级, 2013-04--2014-12
( 5 ) 建筑能耗数据挖掘与分析工具包开发, 参与, 省级, 2013-12--2014-12
( 6 ) 汽车自适应巡航控制(ACC)系统及方法, 主持, 省级, 2013-09--2016-05
( 7 ) 人机交互的监督强化学习控制理论和方法, 主持, 研究所(学校), 2015-01--2016-12
( 8 ) 深度自适应动态规划理论方法和应用, 主持, 国家级, 2016-01--2019-12
( 9 ) 基于数据的建筑群及分布式能源系统一体化建模与自学习优化控制, 参与, 国家级, 2016-01--2020-12
( 10 ) 中国科学院海外评审专家(何海波), 主持, 部委级, 2015-01--2016-12
( 11 ) 智能辅助驾驶控制系统关键技术研究与产品开发, 主持, 国家级, 2016-07--2019-06
( 12 ) 不完全信息动态博弈的优化决策, 主持, 国家级, 2017-03--2018-12
( 13 ) 深度神经网络优化的群体协作神经动力学方法, 主持, 部委级, 2018-01--2020-12
( 14 ) 智能驾驶危险目标检测的深度强化学习方法, 主持, 省级, 2018-01--2019-12
( 15 ) 高度自动驾驶(L4级)电动汽车关键技术研发及验证平台开发--深度强化学习应用, 主持, 省级, 2018-01--2019-12
( 16 ) “气虚证辨证标准的系统研究”的中医证候辨证新方法研究--中医AI, 主持, 国家级, 2018-01--2020-12
( 17 ) 基于人工智能的智能驾驶体验科普展品, 主持, 省级, 2018-01--2018-12
( 18 ) 强化学习技术和硬件化技术研究, 主持, 院级, 2018-09--2019-06
( 19 ) 面向智能驾驶的深度强化学习方法研究, 主持, 院级, 2018-09--2019-08
( 20 ) 面向地铁运营场景需求的智能感知核心技术研究, 主持, 院级, 2018-09--2019-08
( 21 ) 非完全信息条件下的博弈决策--知识与数据共同驱动的深度强化学习算法, 主持, 国家级, 2020-01--2022-12
( 22 ) 复杂城市交互场景下的电动汽车智能决策技术, 主持, 省级, 2019-07--2020-06
( 23 ) 适配硬件的算子结构优化及自动并行切分技术研究, 主持, 院级, 2019-08--2020-05
参与会议
(1)Deep Reinforcement Learning for Video Game   华为多智能体强化学习研讨会   2019-04-25
(2)深度强化学习算法与医疗应用   中国中医药信息研究会临床研究分会第三届学术年会   2018-09-08
(3)深度强化学习算法与应用   中国自动化学会“深度与宽度强化学习”前沿论坛   2018-05-30
(4)Game AI with RL and DL   2018-05-21
(5)深度强化学习进展:从AlphaGo到AlphaGo Zero   第二届世界智能大会   2018-05-17
(6)Game AIs with RL and DL   2018-05-16
(7)Recent Progress on Deep Reinforcement Learning-- from AlphaGo to AlphaGo Zero   三星机器学习前沿研讨会   2018-01-15
(8)深度强化学习算法及应用   中国电力科学研究院2017年二 零八科学会议—人工智能在电力领域的研究应用方向和关键技术   2017-12-06
(9)Cooperative reinforcement learning for multiple units combat in StarCraft   Kun Shao, Yuanheng Zhu, Dongbin Zhao   2017-11-28
(10)Event-triggered integral reinforcement learning for nonlinear continuous-time systems   Qichao Zhang, Dongbin Zhao   2017-11-28
(11)深度强化学习进展—从AlphaGo到AlphaGo Zero   中国仿真学会智能物联专委会会议   2017-11-17
(12)Off-Policy reinforcement learning for partially unknown nonzero-sum games   2017-11-16
(13)FMR-GA -- A cooperative multi-agent reinformcement learning algorithm based on gradient ascent   2017-11-16
(14)人工智能方法及其在智慧城市中的应用   泰山科技论坛—人工智能在智慧城市建设中的应用研究   2017-11-08
(15)A Kolmogorov-Smirnov test to detect changes in stationarity in big data   2017-07-06
(16)Multi-task learning with Cartesian product-based multi-objective combination for dangerous object detection   2017-06-10
(17)Data-driven adaptive dynamic programming for two-player nonzero-sum game   2017-05-29
(18)Comparison of methods to efficient graph SLAM under general optimization framework   2017-05-19
(19)Policy gradient methods with gaussian process modelling acceleration   2017-05-16

指导学生

已指导学生

田艺  硕士研究生  081101-控制理论与控制工程  

胡朝辉  硕士研究生  081101-控制理论与控制工程  

戴钰桀  博士研究生  081101-控制理论与控制工程  

苏永生  硕士研究生  081101-控制理论与控制工程  

张震  博士研究生  081101-控制理论与控制工程  

王滨  博士研究生  081101-控制理论与控制工程  

朱圆恒  博士研究生  081101-控制理论与控制工程  

王海涛  硕士研究生  081101-控制理论与控制工程  

夏中谱  博士研究生  081101-控制理论与控制工程  

张启超  博士研究生  081101-控制理论与控制工程  

吕乐  博士研究生  081101-控制理论与控制工程  

卜丽  博士研究生  081101-控制理论与控制工程  

陈亚冉  博士研究生  081101-控制理论与控制工程  

邵坤  博士研究生  081101-控制理论与控制工程  

李栋  博士研究生  081101-控制理论与控制工程  

现指导学生

唐振韬  博士研究生  081101-控制理论与控制工程  

卢毅  博士研究生  081101-控制理论与控制工程  

李浩然  博士研究生  081101-控制理论与控制工程  

刘民颂  硕士研究生  081101-控制理论与控制工程  

丁子祥  博士研究生  081203-计算机应用技术  

刘育琦  博士研究生  081101-控制理论与控制工程  

李伟凡  博士研究生  081104-模式识别与智能系统  

胡光政  博士研究生  081203-计算机应用技术  

李楠楠  博士研究生  081101-控制理论与控制工程  

王俊杰  博士研究生  081101-控制理论与控制工程