Command Palette
Search for a command to run...
UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022
Zhang Yuanhang ; Liang Susan ; Yang Shuang ; Shan Shiguang

Abstract
This report presents a brief description of our winning solution to the AVAActive Speaker Detection (ASD) task at ActivityNet Challenge 2022. Ourunderlying model UniCon+ continues to build on our previous work, the UnifiedContext Network (UniCon) and Extended UniCon which are designed for robustscene-level ASD. We augment the architecture with a simple GRU-based modulethat allows information of recurring identities to flow across scenes throughread and update operations. We report a best result of 94.47% mAP on theAVA-ActiveSpeaker test set, which continues to rank first on this year'schallenge leaderboard and significantly pushes the state-of-the-art.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| audio-visual-active-speaker-detection-on-ava | UniCon+ | validation mean average precision: 94.5% |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.