Natural Language Processing
主流 AI 模型在各任务上的性能指标比较,展示最前沿的技术水平
AI 模型性能基准
主流 AI 模型在各任务上的性能指标比较,展示最前沿的技术水平
Deep Clustering
50 篇论文 | 5 个基准测试
Semantic Dependency Parsing
50 篇论文 | 3 个基准测试
Word Alignment
50 篇论文 | 7 个基准测试
Few-Shot Text Classification
49 篇论文 | 8 个基准测试
Lemmatization
49 篇论文 | 0 个基准测试
Multimodal Deep Learning
49 篇论文 | 1 个基准测试
Punctuation Restoration
49 篇论文 | 0 个基准测试
Sentence Compression
49 篇论文 | 1 个基准测试
Sentence Ordering
49 篇论文 | 1 个基准测试
Graph-to-Sequence
48 篇论文 | 2 个基准测试
In-Context Learning
48 篇论文 | 0 个基准测试
Relation Extraction
48 篇论文 | 50 个基准测试
Review Generation
48 篇论文 | 0 个基准测试
Rumour Detection
48 篇论文 | 2 个基准测试
Chatbot
47 篇论文 | 1 个基准测试
Dialogue State Tracking
47 篇论文 | 7 个基准测试
Entity Disambiguation
47 篇论文 | 11 个基准测试
Grammatical Error Detection
47 篇论文 | 4 个基准测试
Lexical Normalization
47 篇论文 | 1 个基准测试
Lexical Simplification
47 篇论文 | 0 个基准测试
Semantic Parsing
47 篇论文 | 20 个基准测试
Text Categorization
47 篇论文 | 0 个基准测试
Conversational Response Selection
46 篇论文 | 15 个基准测试
Conversational Search
46 篇论文 | 0 个基准测试
Dialogue Management
46 篇论文 | 0 个基准测试
Document Summarization
46 篇论文 | 7 个基准测试
Goal-Oriented Dialogue Systems
46 篇论文 | 0 个基准测试
Hope Speech Detection
46 篇论文 | 2 个基准测试
Benchmarking
45 篇论文 | 2 个基准测试
Blocking
45 篇论文 | 5 个基准测试
Dependency Parsing
45 篇论文 | 15 个基准测试
Emotion-Cause Pair Extraction
45 篇论文 | 2 个基准测试
Empathetic Response Generation
45 篇论文 | 1 个基准测试
Extractive Text Summarization
45 篇论文 | 5 个基准测试
Generative Question Answering
45 篇论文 | 2 个基准测试
knowledge editing
45 篇论文 | 1 个基准测试
Sentence Embeddings
45 篇论文 | 0 个基准测试
Twitter Sentiment Analysis
45 篇论文 | 0 个基准测试
Decipherment
44 篇论文 | 0 个基准测试
GSM8K
44 篇论文 | 1 个基准测试
Lexical Complexity Prediction
44 篇论文 | 0 个基准测试
Morphological Tagging
44 篇论文 | 0 个基准测试
TAR
44 篇论文 | 0 个基准测试
Text Augmentation
44 篇论文 | 0 个基准测试
Automated Essay Scoring
43 篇论文 | 1 个基准测试
Chinese Word Segmentation
43 篇论文 | 6 个基准测试
Novelty Detection
43 篇论文 | 0 个基准测试
Prompt Engineering
43 篇论文 | 16 个基准测试
Sentence Embedding
43 篇论文 | 0 个基准测试
Sentence Summarization
42 篇论文 | 0 个基准测试
Answer Generation
42 篇论文 | 2 个基准测试
Arabic Sentiment Analysis
42 篇论文 | 0 个基准测试
Cross-Lingual NER
42 篇论文 | 28 个基准测试
Relation Classification
42 篇论文 | 8 个基准测试
Spoken Language Understanding
42 篇论文 | 5 个基准测试
Ad-Hoc Information Retrieval
41 篇论文 | 1 个基准测试
Event Extraction
41 篇论文 | 9 个基准测试
Learning with noisy labels
41 篇论文 | 20 个基准测试
Named Entity Recognition (NER)
41 篇论文 | 77 个基准测试
Reinforcement Learning
41 篇论文 | 21 个基准测试
Safety Alignment
41 篇论文 | 0 个基准测试
Aspect Extraction
40 篇论文 | 6 个基准测试
Dialogue Evaluation
40 篇论文 | 2 个基准测试
Hallucination Evaluation
40 篇论文 | 0 个基准测试
Multimodal Sentiment Analysis
40 篇论文 | 5 个基准测试
Continual Learning
39 篇论文 | 32 个基准测试
Dialogue Generation
39 篇论文 | 13 个基准测试
Distractor Generation
39 篇论文 | 1 个基准测试
Intent Discovery
39 篇论文 | 3 个基准测试
Knowledge Base Population
39 篇论文 | 1 个基准测试
Script Generation
39 篇论文 | 0 个基准测试
Semi-Supervised Text Classification
39 篇论文 | 2 个基准测试
Sequential Pattern Mining
39 篇论文 | 1 个基准测试
Sign Language Production
39 篇论文 | 0 个基准测试
Spelling Correction
39 篇论文 | 0 个基准测试
Text Infilling
39 篇论文 | 0 个基准测试
Abstractive Text Summarization
38 篇论文 | 18 个基准测试
Conversational Question Answering
38 篇论文 | 1 个基准测试
coreference-resolution
38 篇论文 | 0 个基准测试
Dialect Identification
38 篇论文 | 0 个基准测试
Discourse Parsing
38 篇论文 | 4 个基准测试
Discourse Segmentation
38 篇论文 | 0 个基准测试
Document AI
38 篇论文 | 1 个基准测试
Document Classification
38 篇论文 | 21 个基准测试
Entity Alignment
38 篇论文 | 10 个基准测试
Low Resource Named Entity Recognition
38 篇论文 | 3 个基准测试
Self-Learning
38 篇论文 | 0 个基准测试
Text Compression
38 篇论文 | 0 个基准测试
Toxic Spans Detection
38 篇论文 | 0 个基准测试
Emotion Recognition in Conversation
37 篇论文 | 16 个基准测试
Implicit Discourse Relation Classification
37 篇论文 | 0 个基准测试
Recipe Generation
37 篇论文 | 5 个基准测试
Sentence-Pair Classification
37 篇论文 | 0 个基准测试
Speech-to-Text Translation
37 篇论文 | 10 个基准测试
Temporal Relation Extraction
37 篇论文 | 1 个基准测试
Translation
37 篇论文 | 7 个基准测试
Bias Detection
36 篇论文 | 5 个基准测试
Hate Speech Detection
36 篇论文 | 15 个基准测试
Headline Generation
36 篇论文 | 1 个基准测试
Intent Classification
36 篇论文 | 4 个基准测试
Intent Recognition
36 篇论文 | 1 个基准测试
Language Modelling
36 篇论文 | 55 个基准测试
Multilingual Named Entity Recognition
36 篇论文 | 0 个基准测试
Multilingual NLP
36 篇论文 | 0 个基准测试
Phrase Grounding
36 篇论文 | 5 个基准测试
Question Generation
36 篇论文 | 8 个基准测试
Attribute Value Extraction
35 篇论文 | 4 个基准测试
Community Question Answering
35 篇论文 | 2 个基准测试
Emotion Classification
35 篇论文 | 9 个基准测试
Joint Entity and Relation Extraction
35 篇论文 | 16 个基准测试
Query-focused Summarization
35 篇论文 | 0 个基准测试
Text Style Transfer
35 篇论文 | 2 个基准测试
NER
34 篇论文 | 5 个基准测试
Column Type Annotation
34 篇论文 | 12 个基准测试
Image Deblurring
34 篇论文 | 9 个基准测试
Morphological Disambiguation
34 篇论文 | 0 个基准测试
Open-Ended Question Answering
34 篇论文 | 0 个基准测试
Question-Answer-Generation
34 篇论文 | 0 个基准测试
Reverse Dictionary
34 篇论文 | 0 个基准测试
Short Text Clustering
34 篇论文 | 8 个基准测试
Temporal Information Extraction
34 篇论文 | 2 个基准测试
Hypernym Discovery
33 篇论文 | 3 个基准测试
Knowledge Base Question Answering
33 篇论文 | 10 个基准测试
Language Identification
33 篇论文 | 6 个基准测试
Long-range modeling
33 篇论文 | 2 个基准测试
Low Resource NMT
33 篇论文 | 0 个基准测试
Morphological Inflection
33 篇论文 | 0 个基准测试
Native Language Identification
33 篇论文 | 1 个基准测试
Code Repair
32 篇论文 | 1 个基准测试
Document-level Event Extraction
32 篇论文 | 1 个基准测试
Entity Resolution
32 篇论文 | 11 个基准测试
Prepositional Phrase Attachment
32 篇论文 | 0 个基准测试
Suggestion mining
32 篇论文 | 0 个基准测试
Aspect Category Detection
31 篇论文 | 4 个基准测试
Clickbait Detection
31 篇论文 | 0 个基准测试
HellaSwag
31 篇论文 | 0 个基准测试
Passage Re-Ranking
31 篇论文 | 2 个基准测试
Table annotation
31 篇论文 | 0 个基准测试
Cross-Lingual Natural Language Inference
30 篇论文 | 4 个基准测试
Open-Domain Question Answering
30 篇论文 | 15 个基准测试
Unsupervised Extractive Summarization
30 篇论文 | 3 个基准测试
Word Sense Disambiguation
30 篇论文 | 15 个基准测试
Dialogue Understanding
29 篇论文 | 0 个基准测试
Keyphrase Generation
29 篇论文 | 1 个基准测试
LAMBADA
29 篇论文 | 1 个基准测试
Medical Named Entity Recognition
29 篇论文 | 2 个基准测试
Natural Language Inference
29 篇论文 | 37 个基准测试
Nested Named Entity Recognition
29 篇论文 | 6 个基准测试
Part-Of-Speech Tagging
29 篇论文 | 15 个基准测试
Reading Comprehension
29 篇论文 | 7 个基准测试
Argument Mining
28 篇论文 | 1 个基准测试
Coherence Evaluation
28 篇论文 | 2 个基准测试
Implicatures
28 篇论文 | 1 个基准测试
multimodal generation
28 篇论文 | 1 个基准测试
News Generation
28 篇论文 | 0 个基准测试
Stance Detection
28 篇论文 | 22 个基准测试
Active Learning
27 篇论文 | 1 个基准测试
Aggression Identification
27 篇论文 | 0 个基准测试
Definition Extraction
27 篇论文 | 0 个基准测试
Drug–drug Interaction Extraction
27 篇论文 | 3 个基准测试
Emotion Cause Extraction
27 篇论文 | 1 个基准测试
Entity Extraction using GAN
27 篇论文 | 0 个基准测试
Entity Linking
27 篇论文 | 27 个基准测试
Legal Reasoning
27 篇论文 | 2 个基准测试
Mamba
27 篇论文 | 0 个基准测试
Question Selection
27 篇论文 | 1 个基准测试
Temporal Relation Classification
27 篇论文 | 4 个基准测试
Toxic Comment Classification
27 篇论文 | 4 个基准测试
Transliteration
27 篇论文 | 0 个基准测试
Word Similarity
27 篇论文 | 1 个基准测试
Decoder
26 篇论文 | 0 个基准测试
Opinion Mining
26 篇论文 | 1 个基准测试
Pretrained Multilingual Language Models
26 篇论文 | 0 个基准测试
Question Rewriting
26 篇论文 | 0 个基准测试
Table-based Fact Verification
26 篇论文 | 1 个基准测试
Abstract Argumentation
25 篇论文 | 0 个基准测试
Cross-Lingual Document Classification
25 篇论文 | 10 个基准测试
Cross-Lingual Question Answering
25 篇论文 | 3 个基准测试
Deep Learning
25 篇论文 | 0 个基准测试
Diachronic Word Embeddings
25 篇论文 | 0 个基准测试
Event Causality Identification
25 篇论文 | 0 个基准测试
Low-Resource Neural Machine Translation
25 篇论文 | 1 个基准测试
Protein Folding
25 篇论文 | 0 个基准测试
Timeline Summarization
25 篇论文 | 1 个基准测试
Automatic Post-Editing
24 篇论文 | 0 个基准测试
CCG Supertagging
24 篇论文 | 1 个基准测试
Coreference Resolution
24 篇论文 | 16 个基准测试
Literature Mining
24 篇论文 | 0 个基准测试
Method name prediction
24 篇论文 | 1 个基准测试
Topic Models
24 篇论文 | 6 个基准测试
Unsupervised Dependency Parsing
24 篇论文 | 1 个基准测试
Chinese Named Entity Recognition
23 篇论文 | 7 个基准测试
Emotional Intelligence
23 篇论文 | 1 个基准测试
Few-Shot Relation Classification
23 篇论文 | 4 个基准测试
Image to Video Generation
23 篇论文 | 0 个基准测试
Semantic Retrieval
23 篇论文 | 1 个基准测试
Taxonomy Expansion
23 篇论文 | 0 个基准测试
Text-to-Image Generation
23 篇论文 | 13 个基准测试
Text-To-SQL
23 篇论文 | 10 个基准测试
Winogrande
23 篇论文 | 0 个基准测试
Abuse Detection
22 篇论文 | 0 个基准测试
Cross-Lingual Entity Linking
22 篇论文 | 0 个基准测试
Data-free Knowledge Distillation
22 篇论文 | 2 个基准测试
Dialog Act Classification
22 篇论文 | 1 个基准测试
Extract Aspect
22 篇论文 | 1 个基准测试
Extreme Summarization
22 篇论文 | 4 个基准测试
Scientific Document Summarization
22 篇论文 | 1 个基准测试
Short-Text Conversation
22 篇论文 | 0 个基准测试
Table Retrieval
22 篇论文 | 1 个基准测试
Text Retrieval
22 篇论文 | 16 个基准测试
Word Translation
22 篇论文 | 0 个基准测试
Cloze Test
21 篇论文 | 2 个基准测试
Constituency Grammar Induction
21 篇论文 | 1 个基准测试
Conversational Response Generation
21 篇论文 | 0 个基准测试
Cross Document Coreference Resolution
21 篇论文 | 0 个基准测试
KG-to-Text Generation
21 篇论文 | 11 个基准测试
Large Language Model
21 篇论文 | 2 个基准测试
Linguistic Acceptability
21 篇论文 | 5 个基准测试
Opinion Summarization
21 篇论文 | 0 个基准测试
Passage Ranking
21 篇论文 | 1 个基准测试
Text Clustering
21 篇论文 | 3 个基准测试
Zero-shot Slot Filling
21 篇论文 | 3 个基准测试
Dependency Grammar Induction
20 篇论文 | 2 个基准测试
Entity Typing
20 篇论文 | 8 个基准测试
Intent Detection
20 篇论文 | 19 个基准测试
Key Information Extraction
20 篇论文 | 6 个基准测试
LLM-generated Text Detection
20 篇论文 | 0 个基准测试
Paraphrase Identification
20 篇论文 | 11 个基准测试
Probing Language Models
20 篇论文 | 1 个基准测试
Specificity
20 篇论文 | 0 个基准测试
Text Anonymization
20 篇论文 | 0 个基准测试
Cross-Domain Named Entity Recognition
19 篇论文 | 1 个基准测试
Dynamic Topic Modeling
19 篇论文 | 0 个基准测试
Explanation Generation
19 篇论文 | 5 个基准测试
Fine-Grained Opinion Analysis
19 篇论文 | 1 个基准测试
Formality Style Transfer
19 篇论文 | 1 个基准测试
Linguistic steganography
19 篇论文 | 0 个基准测试
Low Resource Neural Machine Translation
19 篇论文 | 0 个基准测试
Multi-Hop Reading Comprehension
19 篇论文 | 0 个基准测试
Multi-Label Text Classification
19 篇论文 | 20 个基准测试
News Classification
19 篇论文 | 4 个基准测试
Relationship Extraction (Distant Supervised)
19 篇论文 | 2 个基准测试
text annotation
19 篇论文 | 0 个基准测试
Text-to-Video Generation
19 篇论文 | 6 个基准测试
Toponym Resolution
19 篇论文 | 0 个基准测试
XLM-R
19 篇论文 | 0 个基准测试
Aspect Category Sentiment Analysis
18 篇论文 | 1 个基准测试
Component Classification
18 篇论文 | 1 个基准测试
Data-to-Text Generation
18 篇论文 | 26 个基准测试
Event Relation Extraction
18 篇论文 | 0 个基准测试
Language Acquisition
18 篇论文 | 1 个基准测试
Story Generation
18 篇论文 | 5 个基准测试
Answer Selection
17 篇论文 | 6 个基准测试
Chinese Spell Checking
17 篇论文 | 1 个基准测试
Complex Word Identification
17 篇论文 | 0 个基准测试
Concept-To-Text Generation
17 篇论文 | 1 个基准测试
De-identification
17 篇论文 | 0 个基准测试
Gender Bias Detection
17 篇论文 | 0 个基准测试
Memorization
17 篇论文 | 1 个基准测试
nlg evaluation
17 篇论文 | 0 个基准测试
POS Tagging
17 篇论文 | 2 个基准测试
Semantic Role Labeling
17 篇论文 | 7 个基准测试
Topic coverage
17 篇论文 | 3 个基准测试
Vietnamese Datasets
17 篇论文 | 0 个基准测试
Visual Dialog
17 篇论文 | 8 个基准测试
Zero-Shot Stance Detection
17 篇论文 | 0 个基准测试
AMR Parsing
16 篇论文 | 8 个基准测试
Citation Intent Classification
16 篇论文 | 2 个基准测试
Conditional Text Generation
16 篇论文 | 1 个基准测试
Cross-Lingual Information Retrieval
16 篇论文 | 0 个基准测试
Embeddings Evaluation
16 篇论文 | 0 个基准测试
Fake News Detection
16 篇论文 | 10 个基准测试
Keyword Extraction
16 篇论文 | 3 个基准测试
Relational Reasoning
16 篇论文 | 1 个基准测试
Semantic Textual Similarity
16 篇论文 | 13 个基准测试
Story Completion
16 篇论文 | 0 个基准测试
Table-to-Text Generation
16 篇论文 | 8 个基准测试
Text Summarization
16 篇论文 | 37 个基准测试
Transition-Based Dependency Parsing
16 篇论文 | 0 个基准测试
Zero-Shot Text-to-Image Generation
16 篇论文 | 0 个基准测试
Abstract Meaning Representation
15 篇论文 | 0 个基准测试
Action Parsing
15 篇论文 | 1 个基准测试
Aspect-Based Sentiment Analysis (ABSA)
15 篇论文 | 18 个基准测试
Authorship Verification
15 篇论文 | 0 个基准测试
Continual Relation Extraction
15 篇论文 | 0 个基准测试
Dialogue Act Classification
15 篇论文 | 5 个基准测试
Language Modeling
15 篇论文 | 0 个基准测试
Machine Translation
15 篇论文 | 83 个基准测试
PICO
15 篇论文 | 1 个基准测试
Polyphone disambiguation
15 篇论文 | 1 个基准测试
Prosody Prediction
15 篇论文 | 1 个基准测试
Question Answering
15 篇论文 | 149 个基准测试
Temporal Tagging
15 篇论文 | 8 个基准测试
Aspect Term Extraction and Sentiment Classification
14 篇论文 | 1 个基准测试
Cross-Domain Text Classification
14 篇论文 | 0 个基准测试
Dialog Relation Extraction
14 篇论文 | 2 个基准测试
Fact Selection
14 篇论文 | 1 个基准测试
Implicit Relations
14 篇论文 | 1 个基准测试
Key Point Matching
14 篇论文 | 0 个基准测试
Profile Generation
14 篇论文 | 1 个基准测试
Semantic entity labeling
14 篇论文 | 2 个基准测试
Spam detection
14 篇论文 | 1 个基准测试
Table-based Question Answering
14 篇论文 | 0 个基准测试
Table Search
14 篇论文 | 0 个基准测试
Text Generation
14 篇论文 | 71 个基准测试
Automated Writing Evaluation
13 篇论文 | 0 个基准测试
Cell Entity Annotation
13 篇论文 | 5 个基准测试
Comment Generation
13 篇论文 | 0 个基准测试
Commonsense Causal Reasoning
13 篇论文 | 0 个基准测试
DRS Parsing
13 篇论文 | 2 个基准测试
Extractive Summarization
13 篇论文 | 0 个基准测试
Few-shot NER
13 篇论文 | 4 个基准测试
Long-Context Understanding
13 篇论文 | 5 个基准测试
Model Editing
13 篇论文 | 0 个基准测试
Parallel Corpus Mining
13 篇论文 | 0 个基准测试
Persian Sentiment Analysis
13 篇论文 | 0 个基准测试
RAG
13 篇论文 | 0 个基准测试
Text Classification
13 篇论文 | 85 个基准测试
UCCA Parsing
13 篇论文 | 2 个基准测试
Arabic Text Diacritization
12 篇论文 | 2 个基准测试
Causal Emotion Entailment
12 篇论文 | 1 个基准测试
Conversation Disentanglement
12 篇论文 | 3 个基准测试
Humor Detection
12 篇论文 | 1 个基准测试
Key-value Pair Extraction
12 篇论文 | 2 个基准测试
Negation Scope Resolution
12 篇论文 | 4 个基准测试
Predicate Detection
12 篇论文 | 3 个基准测试
Relevance Detection
12 篇论文 | 0 个基准测试
Sentence Pair Modeling
12 篇论文 | 0 个基准测试
Session Search
12 篇论文 | 0 个基准测试
Simultaneous Speech-to-Text Translation
12 篇论文 | 0 个基准测试
Unsupervised Text Classification
12 篇论文 | 4 个基准测试
Author Attribution
11 篇论文 | 0 个基准测试
Columns Property Annotation
11 篇论文 | 4 个基准测试
End-To-End Dialogue Modelling
11 篇论文 | 2 个基准测试
Hint Generation
11 篇论文 | 0 个基准测试
Mathematical Question Answering
11 篇论文 | 2 个基准测试
Multiple Choice Question Answering (MCQA)
11 篇论文 | 31 个基准测试
Nested Mention Recognition
11 篇论文 | 2 个基准测试
Paper generation
11 篇论文 | 2 个基准测试
Passage Retrieval
11 篇论文 | 6 个基准测试
Question Similarity
11 篇论文 | 1 个基准测试
Satire Detection
11 篇论文 | 0 个基准测试
Subjectivity Analysis
11 篇论文 | 2 个基准测试
Toponym Recognition
11 篇论文 | 0 个基准测试
Vietnamese Word Segmentation
11 篇论文 | 0 个基准测试
Zero-Shot Cross-Lingual Transfer
11 篇论文 | 2 个基准测试
Zero-shot Named Entity Recognition (NER)
11 篇论文 | 4 个基准测试
Abusive Language
10 篇论文 | 0 个基准测试
Chunking
10 篇论文 | 5 个基准测试
Cross-Lingual Semantic Textual Similarity
10 篇论文 | 0 个基准测试
Document Ranking
10 篇论文 | 2 个基准测试
Lay Summarization
10 篇论文 | 2 个基准测试
Multi-modal Named Entity Recognition
10 篇论文 | 5 个基准测试
Natural Language Understanding
10 篇论文 | 6 个基准测试
Open-Domain Dialog
10 篇论文 | 1 个基准测试
Semantic Composition
10 篇论文 | 0 个基准测试
Semantic Shift Detection
10 篇论文 | 0 个基准测试
Simultaneous Speech-to-Speech Translation
10 篇论文 | 0 个基准测试
Only Connect Walls Dataset Task 1 (Grouping)
10 篇论文 | 1 个基准测试
Text Simplification
10 篇论文 | 11 个基准测试
Variable Detection
10 篇论文 | 1 个基准测试
Zero-shot Event Extraction
10 篇论文 | 0 个基准测试
AI Agent
9 篇论文 | 0 个基准测试
answerability prediction
9 篇论文 | 1 个基准测试
Binary Relation Extraction
9 篇论文 | 2 个基准测试
Bridging Anaphora Resolution
9 篇论文 | 0 个基准测试
Chinese Zero Pronoun Resolution
9 篇论文 | 0 个基准测试
Connective Detection
9 篇论文 | 0 个基准测试
Document Dating
9 篇论文 | 2 个基准测试
Image-guided Story Ending Generation
9 篇论文 | 2 个基准测试
molecular representation
9 篇论文 | 0 个基准测试
Response Generation
9 篇论文 | 3 个基准测试
Sentiment Analysis
9 篇论文 | 42 个基准测试
Unsupervised Opinion Summarization
9 篇论文 | 3 个基准测试
Vietnamese Social Media Text Processing
9 篇论文 | 0 个基准测试
Author Profiling
8 篇论文 | 0 个基准测试
Belebele
8 篇论文 | 0 个基准测试
Bilingual Lexicon Induction
8 篇论文 | 0 个基准测试
Cross-Lingual Word Embeddings
8 篇论文 | 0 个基准测试
Definition Modelling
8 篇论文 | 0 个基准测试
Dialog Learning
8 篇论文 | 0 个基准测试
Emotion Recognition in Context
8 篇论文 | 4 个基准测试
Grammatical Error Correction
8 篇论文 | 13 个基准测试
Handwritten Chinese Text Recognition
8 篇论文 | 0 个基准测试
Multi-agent Integration
8 篇论文 | 1 个基准测试
Offline Handwritten Chinese Character Recognition
8 篇论文 | 0 个基准测试
Paraphrase Generation
8 篇论文 | 3 个基准测试
Sarcasm Detection
8 篇论文 | 9 个基准测试
Spatial Reasoning
8 篇论文 | 2 个基准测试
Summarization
8 篇论文 | 12 个基准测试
target-oriented opinion words extraction
8 篇论文 | 0 个基准测试
Thai Word Segmentation
8 篇论文 | 2 个基准测试
Unsupervised Sentence Summarization
8 篇论文 | 0 个基准测试
User Simulation
8 篇论文 | 0 个基准测试
Vietnamese Hate Speech Detection
8 篇论文 | 0 个基准测试
WNLI
8 篇论文 | 0 个基准测试
Zero-Shot Machine Translation
8 篇论文 | 0 个基准测试
Aspect-oriented Opinion Extraction
7 篇论文 | 1 个基准测试
Code Documentation Generation
7 篇论文 | 7 个基准测试
Contextualised Word Representations
7 篇论文 | 0 个基准测试
Dialogue Rewriting
7 篇论文 | 3 个基准测试
Few-Shot Stance Detection
7 篇论文 | 0 个基准测试
Image Segmentation
7 篇论文 | 12 个基准测试
Japanese Word Segmentation
7 篇论文 | 1 个基准测试
Meme Classification
7 篇论文 | 3 个基准测试
Occupation prediction
7 篇论文 | 0 个基准测试
Open Intent Discovery
7 篇论文 | 6 个基准测试
Privacy Preserving Deep Learning
7 篇论文 | 0 个基准测试
Propaganda detection
7 篇论文 | 0 个基准测试
Propaganda span identification
7 篇论文 | 0 个基准测试
Query-Based Extractive Summarization
7 篇论文 | 1 个基准测试
Slot Filling
7 篇论文 | 14 个基准测试
SNARKS
7 篇论文 | 0 个基准测试
Text Attribute Transfer
7 篇论文 | 0 个基准测试
Timex normalization
7 篇论文 | 2 个基准测试
Vietnamese Visual Question Answering
7 篇论文 | 0 个基准测试
Word Sense Induction
7 篇论文 | 1 个基准测试
Aspect-Category-Opinion-Sentiment Quadruple Extraction
6 篇论文 | 2 个基准测试
Aspect Category Polarity
6 篇论文 | 1 个基准测试
Cognate Prediction
6 篇论文 | 0 个基准测试
Cross-Lingual Bitext Mining
6 篇论文 | 4 个基准测试
Deep Attention
6 篇论文 | 0 个基准测试
Equation Discovery
6 篇论文 | 0 个基准测试
Fact Verification
6 篇论文 | 3 个基准测试
Grounded language learning
6 篇论文 | 0 个基准测试
Information Retrieval
6 篇论文 | 34 个基准测试
Math Word Problem Solving
6 篇论文 | 13 个基准测试
Mathematical Reasoning
6 篇论文 | 11 个基准测试
Morpheme Segmentaiton
6 篇论文 | 1 个基准测试
News Annotation
6 篇论文 | 0 个基准测试
Open Intent Detection
6 篇论文 | 17 个基准测试
Selection bias
6 篇论文 | 0 个基准测试
Syntax Representation
6 篇论文 | 0 个基准测试
Task-Completion Dialogue Policy Learning
6 篇论文 | 0 个基准测试
Temporal/Casual QA
6 篇论文 | 1 个基准测试
Term Extraction
6 篇论文 | 2 个基准测试
text-to-Cypher
6 篇论文 | 0 个基准测试
Vietnamese Language Models
6 篇论文 | 0 个基准测试
Zero-shot Sentiment Classification
6 篇论文 | 1 个基准测试
Argument Pair Extraction (APE)
5 篇论文 | 1 个基准测试
Binary Condescension Detection
5 篇论文 | 1 个基准测试
Continual Named Entity Recognition
5 篇论文 | 0 个基准测试
Cross-Lingual Transfer
5 篇论文 | 1 个基准测试
Dialogue Interpretation
5 篇论文 | 0 个基准测试
Drug Design
5 篇论文 | 0 个基准测试
DrugProt
5 篇论文 | 1 个基准测试
Job classification
5 篇论文 | 0 个基准测试
Job Prediction
5 篇论文 | 0 个基准测试
Lexical Analysis
5 篇论文 | 0 个基准测试
Long Form Question Answering
5 篇论文 | 0 个基准测试
Multi-label Condescension Detection
5 篇论文 | 1 个基准测试
Multimodal Machine Translation
5 篇论文 | 3 个基准测试
Named Entity Recognition In Vietnamese
5 篇论文 | 2 个基准测试
Personality Alignment
5 篇论文 | 0 个基准测试
Reading Order Detection
5 篇论文 | 2 个基准测试
Riddle Sense
5 篇论文 | 2 个基准测试
Scientific Results Extraction
5 篇论文 | 2 个基准测试
Stereotypical Bias Analysis
5 篇论文 | 1 个基准测试
Text Effects Transfer
5 篇论文 | 0 个基准测试
Unsupervised Part-Of-Speech Tagging
5 篇论文 | 0 个基准测试
Vietnamese Image Captioning
5 篇论文 | 0 个基准测试
Zero-shot Relation Triplet Extraction
5 篇论文 | 2 个基准测试
Abstract Anaphora Resolution
4 篇论文 | 1 个基准测试
Attribute Mining
4 篇论文 | 3 个基准测试
Authorship Attribution
4 篇论文 | 0 个基准测试
Bangla Spelling Error Correction
4 篇论文 | 1 个基准测试
Chemical Indexing
4 篇论文 | 1 个基准测试
Class-level Code Generation
4 篇论文 | 1 个基准测试
Cross-lingual zero-shot dependency parsing
4 篇论文 | 1 个基准测试
Chinese Spelling Error Correction
4 篇论文 | 0 个基准测试
Document-level Relation Extraction
4 篇论文 | 3 个基准测试
Emotional Dialogue Acts
4 篇论文 | 0 个基准测试
Empirical Judgments
4 篇论文 | 1 个基准测试
Extracting COVID-19 Events from Twitter
4 篇论文 | 1 个基准测试
Face Selection
4 篇论文 | 0 个基准测试
Goal-Oriented Dialog
4 篇论文 | 1 个基准测试
Hope Speech Detection for Tamil
4 篇论文 | 1 个基准测试
Information Threading
4 篇论文 | 2 个基准测试
Instruction Following
4 篇论文 | 1 个基准测试
Interactive Evaluation of Dialog
4 篇论文 | 1 个基准测试
Joint Multilingual Sentence Representations
4 篇论文 | 0 个基准测试
Logical Reasoning Question Answering
4 篇论文 | 1 个基准测试
Logical Reasoning Reading Comprehension
4 篇论文 | 0 个基准测试
Misogynistic Aggression Identification
4 篇论文 | 0 个基准测试
Multimodal Attribute Value Extraction
4 篇论文 | 0 个基准测试
Open Information Extraction
4 篇论文 | 13 个基准测试
Page Stream Segmentation
4 篇论文 | 0 个基准测试
Personality Generation
4 篇论文 | 0 个基准测试
Reliable Intelligence Identification
4 篇论文 | 0 个基准测试
Semantic Role Labeling (predicted predicates)
4 篇论文 | 2 个基准测试
Speculation Detection
4 篇论文 | 0 个基准测试
Text-Based Stock Prediction
4 篇论文 | 0 个基准测试
Text-to-video search
4 篇论文 | 0 个基准测试
Timedial
4 篇论文 | 1 个基准测试
Twitter Event Detection
4 篇论文 | 1 个基准测试
Unsupervised Sentence Compression
4 篇论文 | 0 个基准测试
Unsupervised semantic parsing
4 篇论文 | 2 个基准测试
Vietnamese Fact Checking
4 篇论文 | 0 个基准测试
Vietnamese Speech Recognition
4 篇论文 | 0 个基准测试
AI and Safety
3 篇论文 | 0 个基准测试
Aspect Category Sentiment Classification
3 篇论文 | 0 个基准测试
Aspect-Sentiment-Opinion Triplet Extraction
3 篇论文 | 1 个基准测试
Constituency Parsing
3 篇论文 | 4 个基准测试
Conversational Web Navigation
3 篇论文 | 1 个基准测试
Dark Humor Detection
3 篇论文 | 1 个基准测试
Data Mining
3 篇论文 | 0 个基准测试
Dialogue Safety Prediction
3 篇论文 | 2 个基准测试
Disambiguation QA
3 篇论文 | 0 个基准测试
Discourse Marker Prediction
3 篇论文 | 1 个基准测试
Domain Labelling
3 篇论文 | 1 个基准测试
End-to-End RST Parsing
3 篇论文 | 1 个基准测试
English Proverbs
3 篇论文 | 1 个基准测试
Extract aspect-polarity tuple
3 篇论文 | 1 个基准测试
Few-shot HTC
3 篇论文 | 0 个基准测试
Formal Fallacies Syllogisms Negation
3 篇论文 | 0 个基准测试
Hate Speech Normalization
3 篇论文 | 0 个基准测试
Hyperbaton
3 篇论文 | 0 个基准测试
image-sentence alignment
3 篇论文 | 12 个基准测试
Information Extraction
3 篇论文 | 1 个基准测试
KB-to-Language Generation
3 篇论文 | 1 个基准测试
Meme Captioning
3 篇论文 | 0 个基准测试
Memex Question Answering
3 篇论文 | 1 个基准测试
Multi-modal Dialogue Generation
3 篇论文 | 1 个基准测试
Negation Detection
3 篇论文 | 0 个基准测试
Personality Recognition in Conversation
3 篇论文 | 1 个基准测试
Phrase Ranking
3 篇论文 | 2 个基准测试
Phrase Relatedness
3 篇论文 | 1 个基准测试
Phrase Tagging
3 篇论文 | 2 个基准测试
Political Salient Issue Orientation Detection
3 篇论文 | 1 个基准测试
Poll Generation
3 篇论文 | 1 个基准测试
Recognizing Emotion Cause in Conversations
3 篇论文 | 2 个基准测试
Record linking
3 篇论文 | 0 个基准测试
Relational Captioning
3 篇论文 | 1 个基准测试
Ruin Names
3 篇论文 | 0 个基准测试
Sentence Classification
3 篇论文 | 6 个基准测试
Sentence Embeddings For Biomedical Texts
3 篇论文 | 2 个基准测试
Social Media Mental Health Detection
3 篇论文 | 0 个基准测试
Sonnet Generation
3 篇论文 | 0 个基准测试
Speculation Scope Resolution
3 篇论文 | 3 个基准测试
Turning Point Identification
3 篇论文 | 0 个基准测试
Vietnamese Aspect-Based Sentiment Analysis
3 篇论文 | 0 个基准测试
Vietnamese Natural Language Understanding
3 篇论文 | 0 个基准测试
Vietnamese Scene Text
3 篇论文 | 0 个基准测试
Vietnamese Sentiment Analysis
3 篇论文 | 0 个基准测试
4-ary Relation Extraction
2 篇论文 | 1 个基准测试
ArabicMMLU
2 篇论文 | 0 个基准测试
Automatic Writing
2 篇论文 | 0 个基准测试
Claim-Evidence Pair Extraction (CEPE)
2 篇论文 | 1 个基准测试
Claim Extraction with Stance Classification (CESC)
2 篇论文 | 1 个基准测试
Clinical Information Retreival
2 篇论文 | 0 个基准测试
Clinical Language Translation
2 篇论文 | 0 个基准测试
Clinical Section Identification
2 篇论文 | 1 个基准测试
Collaborative Plan Acquisition
2 篇论文 | 0 个基准测试
Context Query Reformulation
2 篇论文 | 0 个基准测试
Croatian Text Diacritization
2 篇论文 | 1 个基准测试
Cross-lingual Text-to-Image Generation
2 篇论文 | 0 个基准测试
Czech Text Diacritization
2 篇论文 | 1 个基准测试
Description-guided molecule generation
2 篇论文 | 1 个基准测试
Document-level Closed Information Extraction
2 篇论文 | 3 个基准测试
Document-level RE with incomplete labeling
2 篇论文 | 2 个基准测试
Email Thread Summarization
2 篇论文 | 2 个基准测试
Event-Driven Trading
2 篇论文 | 0 个基准测试
Fantasy Reasoning
2 篇论文 | 1 个基准测试
few-shot-htc
2 篇论文 | 0 个基准测试
Figure Of Speech Detection
2 篇论文 | 1 个基准测试
French Text Diacritization
2 篇论文 | 1 个基准测试
GRE Reading Comprehension
2 篇论文 | 1 个基准测试
Hate Span Identification
2 篇论文 | 0 个基准测试
Hidden Aspect Detection
2 篇论文 | 0 个基准测试
Hierarchical Text Classification of Blurbs (GermEval 2019)
2 篇论文 | 1 个基准测试
Hierarchical Text Clustering
2 篇论文 | 0 个基准测试
Hope Speech Detection for English
2 篇论文 | 1 个基准测试
Hope Speech Detection for Malayalam
2 篇论文 | 1 个基准测试
Hungarian Text Diacritization
2 篇论文 | 1 个基准测试
Hyper-Relational Extraction
2 篇论文 | 1 个基准测试
Image-to-Text Retrieval
2 篇论文 | 8 个基准测试
incongruity detection
2 篇论文 | 0 个基准测试
Intrusion Detection
2 篇论文 | 5 个基准测试
Irish Text Diacritization
2 篇论文 | 1 个基准测试
Irony Identification
2 篇论文 | 1 个基准测试
Keyphrase Extraction
2 篇论文 | 6 个基准测试
Latvian Text Diacritization
2 篇论文 | 1 个基准测试
legal outcome extraction
2 篇论文 | 0 个基准测试
Machine Reading Comprehension
2 篇论文 | 4 个基准测试
Math Information Retrieval
2 篇论文 | 1 个基准测试
Molecular description generation
2 篇论文 | 0 个基准测试
Movie Dialog Same Or Different
2 篇论文 | 1 个基准测试
Multi-Document Summarization
2 篇论文 | 5 个基准测试
Multi-lingual Text-to-Image Generation
2 篇论文 | 0 个基准测试
multilingual cross-modal retrieval
2 篇论文 | 0 个基准测试
Multilingual Paraphrase Generation
2 篇论文 | 0 个基准测试
Multimodal Abstractive Text Summarization
2 篇论文 | 1 个基准测试
Multimodal Lexical Translation
2 篇论文 | 4 个基准测试
Natural Language Transduction
2 篇论文 | 0 个基准测试
Negation and Speculation Cue Detection
2 篇论文 | 2 个基准测试
Negation and Speculation Scope resolution
2 篇论文 | 0 个基准测试
Nonsense Words Grammar
2 篇论文 | 1 个基准测试
Open Relation Modeling
2 篇论文 | 0 个基准测试
Personalized and Emotional Conversation
2 篇论文 | 1 个基准测试
Political evalutation
2 篇论文 | 0 个基准测试
RACE-h
2 篇论文 | 1 个基准测试
RACE-m
2 篇论文 | 1 个基准测试
Reader-Aware Summarization
2 篇论文 | 1 个基准测试
Role-filler Entity Extraction
2 篇论文 | 1 个基准测试
Romanian Text Diacritization
2 篇论文 | 1 个基准测试
Scientific Concept Extraction
2 篇论文 | 1 个基准测试
Semantic Similarity
2 篇论文 | 26 个基准测试
SemEval-2022 Task 4-1 (Binary PCL Detection)
2 篇论文 | 1 个基准测试
SemEval-2022 Task 4-2 (Multi-label PCL Detection)
2 篇论文 | 1 个基准测试
Semi-Supervised Text Regression
2 篇论文 | 0 个基准测试
Sensitivity Classification
2 篇论文 | 1 个基准测试
Sentiment Dependency Learning
2 篇论文 | 0 个基准测试
Sketch-to-text Generation
2 篇论文 | 0 个基准测试
Slovak Text Diacritization
2 篇论文 | 1 个基准测试
Spanish Text Diacritization
2 篇论文 | 1 个基准测试
SSTOD
2 篇论文 | 2 个基准测试
Task-Oriented Dialogue Systems
2 篇论文 | 4 个基准测试
Text Matching
2 篇论文 | 0 个基准测试
Text-to-GQL
2 篇论文 | 0 个基准测试
Text-Variation
2 篇论文 | 0 个基准测试
Textual Analogy Parsing
2 篇论文 | 0 个基准测试
True or False Question Answering
2 篇论文 | 0 个基准测试
trustable and focussed LLM generated content
2 篇论文 | 0 个基准测试
Turkish Text Diacritization
2 篇论文 | 1 个基准测试
Understanding Fables
2 篇论文 | 1 个基准测试
Unsupervised KG-to-Text Generation
2 篇论文 | 4 个基准测试
Unsupervised Machine Translation
2 篇论文 | 9 个基准测试
ValNov
2 篇论文 | 2 个基准测试
Vietnamese Parsing
2 篇论文 | 0 个基准测试
Vietnamese Text Diacritization
2 篇论文 | 1 个基准测试
Visual Commonsense Tests
2 篇论文 | 1 个基准测试
Workflow Discovery
2 篇论文 | 1 个基准测试
Alignement visualisation
1 篇论文 | 0 个基准测试
Anaphora Resolution
1 篇论文 | 0 个基准测试
ARQMath2
1 篇论文 | 0 个基准测试
Aspect Sentiment Triplet Extraction
1 篇论文 | 4 个基准测试
Bangla Text Detection
1 篇论文 | 1 个基准测试
Blackout Poetry Generation
1 篇论文 | 1 个基准测试
Catalog Extraction
1 篇论文 | 1 个基准测试
Cause-Effect Relation Classification
1 篇论文 | 0 个基准测试
Chinese
1 篇论文 | 0 个基准测试
Clinical Assertion Status Detection
1 篇论文 | 1 个基准测试
Coding Problem Tagging
1 篇论文 | 0 个基准测试
Commonsense Reasoning for RL
1 篇论文 | 1 个基准测试
Complaint Comment Classification
1 篇论文 | 0 个基准测试
Context-specific Spam Detection
1 篇论文 | 1 个基准测试
Contextualized Literature-based Discovery
1 篇论文 | 0 个基准测试
Controllable Language Modelling
1 篇论文 | 0 个基准测试
Conversational Sentiment Quadruple Extraction
1 篇论文 | 2 个基准测试
Counterspeech Detection
1 篇论文 | 1 个基准测试
Cross-Document Language Modeling
1 篇论文 | 2 个基准测试
Cross-Language Text Summarization
1 篇论文 | 0 个基准测试
Cross-Lingual
1 篇论文 | 0 个基准测试
Crowdsourced Text Aggregation
1 篇论文 | 2 个基准测试
Detection of potentially void clauses
1 篇论文 | 1 个基准测试
Dialogue
1 篇论文 | 1 个基准测试
Direct NMT
1 篇论文 | 0 个基准测试
Emergent communications on relations
1 篇论文 | 0 个基准测试
Emotion Detection and Trigger Summarization
1 篇论文 | 0 个基准测试
Entity Typing on DH-KGs
1 篇论文 | 0 个基准测试
Extractive Tags Summarization
1 篇论文 | 0 个基准测试
Fact-based Text Editing
1 篇论文 | 2 个基准测试
FG-1-PG-1
1 篇论文 | 3 个基准测试
Figurative Language Visualization
1 篇论文 | 0 个基准测试
Genetic IE
1 篇论文 | 0 个基准测试
GermEval2024 Shared Task 1 Subtask 1
1 篇论文 | 1 个基准测试
GermEval2024 Shared Task 1 Subtask 2
1 篇论文 | 1 个基准测试
Grapheme Detection
1 篇论文 | 0 个基准测试
Grounded Open Vocabulary Acquisition
1 篇论文 | 0 个基准测试
Hate Intensity Prediction
1 篇论文 | 0 个基准测试
Hate Speech Detection CrisisHateMM Benchmark
1 篇论文 | 0 个基准测试
Hurtful Sentence Completion
1 篇论文 | 1 个基准测试
Joint Entity and Relation Extraction on Scientific Data
1 篇论文 | 0 个基准测试
Joint NER and Classification
1 篇论文 | 0 个基准测试
Latent Aspect Detection
1 篇论文 | 0 个基准测试
Legal Document Translation
1 篇论文 | 0 个基准测试
Line Items Extraction
1 篇论文 | 0 个基准测试
Link prediction on DH-KGs
1 篇论文 | 1 个基准测试
Medical question pair similarity computation
1 篇论文 | 0 个基准测试
Meeting Summarization
1 篇论文 | 2 个基准测试
Metric-Type Identification
1 篇论文 | 0 个基准测试
MMSQL performance
1 篇论文 | 1 个基准测试
Morphological Analysis
1 篇论文 | 0 个基准测试
Multi-Dialect Vietnamese
1 篇论文 | 0 个基准测试
Multi-Grained Named Entity Recognition
1 篇论文 | 0 个基准测试
Multi-Labeled Relation Extraction
1 篇论文 | 0 个基准测试
multi-word expression embedding
1 篇论文 | 0 个基准测试
multi-word expression sememe prediction
1 篇论文 | 0 个基准测试
Multilingual Machine Comprehension in English Hindi
1 篇论文 | 1 个基准测试
Multimedia Generative Script Learning
1 篇论文 | 0 个基准测试
Multimodal GIF Dialog
1 篇论文 | 1 个基准测试
Multimodal Text Prediction
1 篇论文 | 1 个基准测试
Multiview Contextual Commonsense Inference
1 篇论文 | 2 个基准测试
Multlingual Neural Machine Translation
1 篇论文 | 0 个基准测试
Natural Language Landmark Navigation Instructions Generation
1 篇论文 | 1 个基准测试
Open-World Social Event Classification
1 篇论文 | 0 个基准测试
Overlapping Mention Recognition
1 篇论文 | 0 个基准测试
Pcl Detection
1 篇论文 | 0 个基准测试
Persona Dialogue in Story
1 篇论文 | 1 个基准测试
Phrase Vector Embedding
1 篇论文 | 0 个基准测试
Poem meters classification
1 篇论文 | 1 个基准测试
Problem-Solving Deliberation
1 篇论文 | 1 个基准测试
Pronunciation Dictionary Creation
1 篇论文 | 0 个基准测试
Propaganda technique identification
1 篇论文 | 0 个基准测试
quantum circuit classification (classical ML)
1 篇论文 | 0 个基准测试
Query Wellformedness
1 篇论文 | 1 个基准测试
Question-Answer categorization
1 篇论文 | 1 个基准测试
Question Quality Assessment
1 篇论文 | 2 个基准测试
Question to Declarative Sentence
1 篇论文 | 0 个基准测试
Readability optimization
1 篇论文 | 0 个基准测试
relation explanation
1 篇论文 | 0 个基准测试
Relation Mention Extraction
1 篇论文 | 0 个基准测试
Row Annotation
1 篇论文 | 1 个基准测试
Rules-of-thumb Generation
1 篇论文 | 0 个基准测试
Semi-Supervised Formality Style Transfer
1 篇论文 | 0 个基准测试
Speaker Attribution in German Parliamentary Debates (GermEval 2023, subtask 1)
1 篇论文 | 1 个基准测试
Stance Detection (US Election 2020 - Biden)
1 篇论文 | 1 个基准测试
Stance Detection (US Election 2020 - Trump)
1 篇论文 | 1 个基准测试
Summarization Consistency Evaluation
1 篇论文 | 1 个基准测试
Table Type Detection
1 篇论文 | 1 个基准测试
Only Connect Walls Dataset Task 2 (Connections)
1 篇论文 | 0 个基准测试
Taxonomy Learning
1 篇论文 | 0 个基准测试
Text-to-CQL
1 篇论文 | 0 个基准测试
Traditional Spam Detection
1 篇论文 | 1 个基准测试
Tweet-Reply Sentiment Analysis
1 篇论文 | 1 个基准测试
Variable Disambiguation
1 篇论文 | 1 个基准测试
Vietnamese Lexical Normalization
1 篇论文 | 0 个基准测试
Vietnamese Multimodal Sentiment Analysis
1 篇论文 | 0 个基准测试
Visual Storytelling
1 篇论文 | 1 个基准测试
Weakly Supervised Data Denoising
1 篇论文 | 0 个基准测试
Web Page Tagging
1 篇论文 | 0 个基准测试
Word Attribute Transfer
1 篇论文 | 0 个基准测试
Zero-Shot Out-of-Domain Detection
1 篇论文 | 0 个基准测试