Audio-Visual
Audio-Visual Processing
- Audio-visual processing system can be defined as cross-modal systems which combine speech and video information together
- The system extracts a common representation between audio and visual signals and then performs various downstream operations.