HyperAI
HyperAI超神经
首页
算力平台
文档
资讯
论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
中文
HyperAI
HyperAI超神经
Toggle sidebar
全站搜索…
⌘
K
Command Palette
Search for a command to run...
首页
SOTA
唇语识别
Lipreading On Lrw 1000
Lipreading On Lrw 1000
评估指标
Top-1 Accuracy
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
Top-1 Accuracy
Paper Title
Repository
SyncVSR (Word Boundary)
58.2
SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization
3D-ResNet + Bi-GRU + MixUp + Label Smooth + Cosine LR (Word Boundary)
55.7%
Learn an Effective Lip Reading Model without Pains
3D Conv + ResNet-18 + MS-TCN + Multi-Head Visual-Audio Memory
53.8
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading
3D Conv + ResNet-18 + Bi-GRU + Visual-Audio Memory
50.82%
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video
-
3D-ResNet + Bi-GRU + MixUp + Label Smooth + Cosine LR
48.3%
Learn an Effective Lip Reading Model without Pains
3D Conv + ResNet-18 + Bi-GRU (Face Cutout)
45.24%
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition
DFTN
41.93%
Deformation Flow Based Two-Stream Network for Lip Reading
GLMIM
38.79%
Mutual Information Maximization for Effective Lip Reading
PCPG
38.7%
Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
-
0 of 9 row(s) selected.
Previous
Next
Lipreading On Lrw 1000 | SOTA | HyperAI超神经