Journal of Sensors

Research Article

Visual Interaction Force Estimation Based on Time-Sensitive Dual-Resolution Learning Network

Comparison results with state-of-the-art spatiotemporal methods.


Method	RMSE	MAE	MSE	R2	Inference time (s)

C3D-Desnet3D (, 400000 images)	0.1541	0.0929	0.0237	0.5797
T3D-EfficientnetB0 (, 400000 images)	0.1652	0.0768	0.0273	0.4491
T3D-Resnet50 (, 400000 images)	0.1225	0.0652	0.0150	0.7926
Our method (, 400000 images)	0.0397	0.0243	0.0015	0.9725
Simplified version of our method (, 400000 images)	0.0313	0.0183	0.0009	0.9833