Adaptive cross-fusion learning for multi-modal gesture recognition
Background: Gesture recognition has attracted significant attention because of its Pass Throughs wide range of potential applications.Although multi-modal gesture recognition has made significant progress in recent years, a popular method still is simply fusing prediction scores at the end of each branch, which often ignores complementary features