Computational Intelligence and Neuroscience

Research Article

A Framework and Algorithm for Human-Robot Collaboration Based on Multimodal Reinforcement Learning

MRLC Multimodal Reinforcement Learning Cooperation

	Input: User_speeches, User_body gestures, User_hand gestures, final_task, M(I, subtask), M(subtask, motion)
	Initialize: NLP, sub_classifier, memory M, episode←0, load θ, Sub_classifier (User_speeches, User_body gestures, User_hand gestures), replace_ iter
	Output: Motion_robot.
	While not finishing final_task do:
	s ← Sub_classifiers
	With probability ε to select a random intention i
	Otherwise use equation (1) to calculate i subtask ← M(i, subtask)
	Motion ← M(subtask, motion)
	Motion_robot ← Motion − Motion_userr ← NLP (feedback_speech)
	//s′ is the next behavior feature of User after robot executes Motion_robot
	s′ ← Sub_classifiers after Robote executes (Motion_robot)
	Calculate Reward r_t according to equation (2)
	M ← (s, i, r, s′) batch_memory ← random choice (M)
	If s means the end of collaboration:
	y′ ← r
	Else:
	Use equation (3) to calculate y′
	Use equation (4) to calculate loss
	Minimize loss
	If (episode > replace_ iter):
	θ⁻ ← θ
	End