KorQuAD 1.0

The Korean Question Answering Dataset




What is KorQuAD 1.0?


KorQuAD 1.0은 한국어 Machine Reading Comprehension을 위해 만든 데이터셋입니다. 모든 질의에 대한 답변은 해당 Wikipedia article 문단의 일부 하위 영역으로 이루어집니다. Stanford Question Answering Dataset(SQuAD) v1.0과 동일한 방식으로 구성되었습니다.






Getting Started


KorQuAD 1.0의 전체 데이터는 1,560 개의 Wikipedia article에 대해 10,645 건의 문단과 66,181 개의 질의응답 쌍으로, Training set 60,407 개, Dev set 5,774 개의 질의응답쌍으로 구분하였습니다.

KorQuAD 1.0의 데이터셋은 CC BY-ND 2.0 KR 라이센스를 따릅니다.

또한 Codalab을 통한 모델 제출시 테스트 스코어 계산 및 리더보드를 통한 스코어 공개에 동의한 것으로 간주합니다. 참고로 제출한 모델 및 소스 코드 등에 대해서는 참가자가 직접 라이센스를 부여하고 이를 명시할 경우 그에 따릅니다.




모델을 평가하기 위한 공식적인 evaluation script와 입력 샘플 prediction 파일을 제공합니다. 평가를 실행하려면 python evaluate-korquad_v1.0.py [path_to_dev-v1.0] [path_to_predictions] 를 입력하세요.




Dev set에 대해 만족하는 모델을 만들었다면 공식 점수를 얻고 leaderboard에 올리기 위해 모델을 제줄하세요. 무분별한 제출을 방지하는 차원에서 일주일에 하나의 모델을 제출하는 것으로 제한합니다. 테스트 결과의 무결성을 위하여 Test set은 공개되지 않습니다. 대신 모델을 제출하여 Test set에서 실행할 수 있도록 해야 합니다. 다음은 모델의 공식적인 평가를 위한 과정 안내 튜토리얼입니다.






Leaderboard


KorQuAD 1.0의 Test set으로 평가한 Exact Match(EM) 및 F1 score 입니다.


Rank Reg. Date Model EM F1
- 2018.10.17 Human Performance 80.17 91.20
1 2023.06.27 EXAONE-LM-v1.0 (single model)

LG AI Research

89.71 96.23
2 2024.02.02 MoBERT-Large V2.0 (single model, 355M)

ETRI XAI-NLP Team

89.05 95.92
3 2022.12.13 VAIV AI

VAIV Company AI Lab (Kisu Yang)

88.28 95.79
4 2023.08.25 MoBERT-Large V1.0 (single model, 355M)

ETRI XAI-NLP Team

88.56 95.66
5 2020.08.24 SDS-XFormer+ (single model)

Samsung SDS AI Research

88.10 95.57
6 2022.03.18 HAIQV-LM-Large V1.0 (single model)

Hanwha Systems/ICT NLP Part

87.71 95.39
7 2020.07.13 LGSP-LM-Large V2.0

LG AI NLP Team

87.46 95.39
8 2021.11.03 SkERT-Large 2.0.0 (ensemble)

Skelter Labs

87.94 95.25
9 2021.12.02 InfoLab KorLM v0.4 (single model)

KAIST InfoLab

88.17 95.24
10 2021.12.07 SkERT-Large 2.0.1 (ensemble)

Skelter Labs

87.58 95.18
11 2021.09.09 InfoLab KorLM v0.3

KAIST InfoLab

87.79 95.16
12 2020.01.08 SkERT-Large (single model)

Skelter Labs

87.66 95.15
13 2020.11.09 Americano (single)

SK Planet RB Dialogue Team(JunSeok Kim)

86.81 95.13
14 2020.07.13 BERT (single model)

Anonymous

86.99 95.12
15 2021.04.08 Summer is coming 1.1 (single model)

Anonymous

87.84 95.08
16 2020.09.08 Tubu the Destroyer 1.2 (single model)

Anonymous

87.87 95.06
17 2023.03.18 LDCC-LM (single model)

Lotte Data Communication AI Technical Team (Wonchul Kim)

87.17 95.04
18 2019.10.25 KorBERT-Large v1.0

ETRI ExoBrain Team

87.76 95.02
19 2021.06.02 SF-Xformer-Large (single model)

Samsung Finance AI Center

87.07 94.82
20 2021.03.29 Summer is coming 1.0 (single model)

Anonymous

86.81 94.81
21 2020.07.08 BERT (single model)

Anonymous

86.45 94.78
22 2021.05.20 InfoLab KorLM v0.2

KAIST InfoLab

87.48 94.77
23 2020.01.07 SkERT-LARGE (single model)

Skelter Labs

87.25 94.75
24 2019.06.26 LaRva-Kor-Large+ + CLaF (single)

Clova AI LaRva Team

86.84 94.75
25 2022.05.03 APplus (single model)

ActionPower

86.92 94.71
26 2020.01.03 SkERT Large (single model)

Skelter Labs

87.28 94.66
27 2021.12.11 mT5-Large v1.0 (single model)

Everdoubling & AISchool

87.38 94.65
28 2019.06.04 BERT-CLKT-MIDDLE (single model)

Anonymous

86.71 94.55
29 2021.11.23 TBA

Anonymous

86.76 94.54
30 2022.10.15 LDCC-LM (single model)

Lotte Data Communication AI Technical Team

86.3 94.45
31 2021.04.10 InfoLab KorLM v0.1

KAIST InfoLab

83.99 94.45
32 2019.06.03 LaRva-Kor-Large + CLaF (single)

Clova AI LaRva Team (LPT)

86.79 94.37
33 2021.11.26 Aibril multilingual T5 - Large (single)

Aibril NLP AI team

87.02 94.37
34 2021.10.20 KonanNet v1.0 (single model)

Konan Technology Inc.

86.07 94.33
35 2020.01.02 SkERT-Large (single model)

Skelter Labs

86.30 94.28
36 2019.03.15 {BERT-CLKT} (single model)

Anonymous

86.22 94.08
37 2019.07.17 KorBERT

Anonymous

86.12 94.02
38 2019.05.07 LaRva-Kor+ + CLaF (single)

Clova AI LaRva Team (LPT)

85.35 93.96
39 2019.04.24 LaRva-Kor+ (single)

Clova AI LaRva Team (LPT)

85.25 93.94
40 2020.05.18 SDS-NET (single model)

Sanghwan Bae & Soonhwan Kwon

85.81 93.92
41 2020.03.24 ElBERT-v1.0 + MixTune + Data Augmentation (single)

Enliple AI Lab

86.17 93.84
42 2020.05.26 Opt (single model)

Anonymous

85.68 93.77
43 2021.08.25 NAMZ-ALBERT V2 (single)

Mediazen NAMZ AI Reseach Team and KISTI National Supercomputing Center

85.12 93.57
44 2019.07.25 Bert-Base-Kor-LEN (ensemble)

ChangWook Jun

85.51 93.46
45 2020.05.27 Baseline (single model)

Anonymous

84.97 93.38
46 2021.02.10 NAMZ-ALBERT (single)

Mediazen NAMZ AI Research Team and KISTI National Supercomputing Center

84.66 93.36
47 2020.10.23 Espresso (single)

SK Planet RB Dialogue Team(JunSeok Kim)

84.35 93.35
48 2021.04.21 Hansol-base-v1.1 (single model)

Hansol Inticube AI convergence LAB

84.38 93.22
49 2019.06.29 BERT-DAL-Masking-Morp (single)

JunSeok Kim

85.15 93.20
50 2020.07.08 ALBERT Large(single model)

Anonymous

84.12 93.07
51 2020.10.12 Cappuccino (single)

SK Planet RB Dialogue Team(JunSeok Kim)

83.48 93.00
52 2019.12.12 HanBert-54k-N (single model)

TwoBlock Ai

81.94 92.93
53 2019.09.20 ETRI BERT (single model)

deepfine

84.56 92.91
54 2019.05.24 BERT fine-tuned(ensemble)

Oh Yeon Taek

83.99 92.89
55 2021.01.17 ActionBasic (single model)

ActionPower

83.76 92.7
56 2019.12.19 HanBert-54k-ML (single model)

TwoBlock Ai

81.89 92.65
57 2019.06.19 ETRI BERT + Saltlux ADAM API (single model)

Saltlux Inc. AI Labs, AIR team

84.15 92.64
58 2019.04.10 BERT-Kor (single)

Clova AI LPT Team

83.79 92.63
59 2019.03.29 BERT insp. by GPT-2 + KHAIII (single)

Kakao NLP Team

84.12 92.62
60 2019.06.19 BERT-DA-Masking-Morph (single)

JunSeok Kim

84.20 92.59
61 2019.12.20 HanBert-90k-N (single model)

TwoBlock Ai

81.61 92.48
62 2019.12.20 HanBert-90k-ML (single model)

TwoBlock Ai

81.35 92.41
63 2019.09.10 ETRI BERT (single model)

deepfine

83.48 92.39
64 2019.04.01 BERT-Multilingual+CLAF+ReTK (single)

KIPI R&D; Center1

83.76 92.27
65 2019.01.30 BERT LM fine-tuned + KHAIII + DHA (single)

Kakao NLP Team

83.32 92.10
66 2019.12.04 BERT+VA (single)

JoonOh-Oh

83.68 92.00
67 2019.01.24 BERT LM fine-tuned (single) + KHAIII

Kakao NLP Team

82.14 91.85
68 2019.01.30 BERT multilingual (ensemble)

mypeacefulcode

82.53 91.67
69 2021.11.23 {T5-base} (single model)

RippleAI

81.68 91.65
70 2021.03.31 Hansol-Base-single-v1 (single)

Hansol Inticube AI convergence LAB

82.4 91.57
71 2019.03.28 BERT KOR (ensemble)

DeepNLP ONE Team

82.68 91.47
72 2019.06.13 {BERT-DA-Morph} (single)

JunSeok Kim

82.48 91.47
73 2019.06.03 DynamicConv + Self-Attention + N-gram masking (single)

Enliple AI and Chonbuk National University, Cognitive Computing Lab

80.94 91.45
74 2019.06.03 BERT_LM_fine-tuned (single)

Anonymous

82.04 91.40
75 2020.06.20 BERT+RNN (ensemble model)

🏆 Enliple AI NLP Challenge 🏆

KHY

SlideShare GitHub
82.22 91.39
76 2019.02.14 BERT fine-tuned (single)

GIST-Dongju Park

82.27 91.24
77 2019.03.21 BERT+KEFT (single)

KT BigData BU

82.27 91.23
78 2019.12.01 BERT (single)

JoonOh-Oh

81.68 91.12
79 2019.02.22 BERT/RPST (single)

Anonymous

82.25 91.11
80 2019.03.08 BERT + ES-Nori (single model)

Chang-Uk Jeong @ RNBSOFT AI Chatbot Team

81.94 91.04
81 2019.10.15 {BERT-base-unigramLM(Kudo)} (single model)

AIRI@domyounglee

78.55 91.04
82 2019.06.19 BERT-Kor-morph (single)

AIRI

80.09 91.01
83 2019.04.08 BERT (single)

Bnonymous

80.58 90.75
84 2020.06.20 BERT+RNN (single model)

🏆 Enliple AI NLP Challenge 🏆

KHY

SlideShare GitHub
81.40 90.74
85 2019.01.10 EBB-Net + BERT (single model)

Enliple AI

80.12 90.71
86 2019.04.10 Bert single-model

NerdFactory, AI research

81.63 90.68
87 2020.02.12 BERT-Multilingual (single model)

Anonymous

81.09 90.61
88 2019.09.05 {ETRI BERT} (single model)

deepfine

80.86 90.61
89 2020.06.25 BERT-Small + Transfer Learning + Adversarial Training (ensemble)

🏆 Enliple AI NLP Challenge 🏆

TmaxAI

SlideShare
81.73 90.55
90 2019.07.11 BERT-Fintent V1 + Utagger-UoU (single)

GDchain AI Lab

79.45 90.38
91 2019.03.13 BERT-Multilingual (single model)

Initiative

80.66 90.35
92 2019.05.08 BERT-Multiling-morph (single)

kwonmha

79.35 90.34
93 2019.03.05 BERT-multilingual (single model)

HYU-Minho Ryu

80.45 90.27
94 2019.09.18 Mobile-BERT(18M Params & 36.6MB size) (single)

Enliple AI and Chonbuk National University, Cognitive Computing Lab

81.07 90.25
95 2019.02.21 Bert_FineTuning (Single model)

Star Ji

71.75 90.12
96 2019.05.08 BERT-Multi-Kr (single)

paul.kim

71.86 89.83
97 2019.03.26 BERT (single model)

BDOT

71.78 89.82
98 2020.04.26 BERTbase (single model)

Anonymous

71.63 89.76
99 2019.06.17 BERT-Multilingual

lyeoni, NEOWIZ AI Lab

71.47 89.71
100 2020.02.17 {Bert_Multi} (multi model)

EunsongGoh

66.73 89.62
101 2020.06.25 BERT-Small + Transfer Learning + Adversarial Training (single model)

🏆 Enliple AI NLP Challenge 🏆

TmaxAI

SlideShare
80.40 89.59
102 2019.04.29 BERT_Multi (Single)

EunsongGoh

71.40 89.49
103 2019.01.11 BERT-Multiling-simple (single)

kwonmha

70.75 89.44
104 2019.02.19 BERT multilingual finetune TPU (single)

jskim_kbnow

71.19 89.20
105 2019.05.21 Bert-Base-Multilingual (Single)

ybigta KorQuAD

70.50 89.14
106 2020.06.19 scBert (single model)

🏆 Enliple AI NLP Challenge 🏆

PNU, delosycho@gmail.com

SlideShare GitHub
79.17 88.99
107 2020.06.19 scBert (single model)

🏆 Enliple AI NLP Challenge 🏆

PNU

SlideShare GitHub
79.27 88.98
108 2020.06.20 SDAC (single model)

🏆 Enliple AI NLP Challenge 🏆

[Enliple AI NLP Challenge] Team BS

SlideShare
78.71 88.95
109 2019.06.01 {BERT-Multilingual fine-tuned+OKT} (single)

JunSeok Kim

77.12 88.92
110 2020.06.20 scBert (single model)

🏆 Enliple AI NLP Challenge 🏆

PNU, Sangyeon, delosycho@gmail.com

SlideShare GitHub
78.86 88.90
111 2020.06.19 SDA (single model)

🏆 Enliple AI NLP Challenge 🏆

[Enliple AI NLP Challenge] Team BS

SlideShare
78.42 88.79
112 2020.06.19 BerT3Q ensemble T3Q-NLP

🏆 Enliple AI NLP Challenge 🏆

Team t3q.com [Enliple AI NLP Challenge]

SlideShare GitHub
78.99 88.65
113 2019.05.04 BERT-multilingual (single)

Anonymous

70.57 88.64
114 2020.06.19 5959 (single model)

🏆 Enliple AI NLP Challenge 🏆

GYKIM

Slides
78.63 88.58
115 2019.04.26 BERT-multilingual (single model)

Tae Hwan Jung@graykode, Kyung Hee Univ

69.86 88.49
116 2020.06.25 BERT-small-SeqBoost (single)

🏆 Enliple AI NLP Challenge 🏆

Yonsei Univ. | Korea Univ.

GitHub
78.27 88.29
117 2018.12.28 BERT-Multilingual (single)

Clova AI LPT Team

77.04 87.85
118 2020.06.18 predictions-200619(single model)

🏆 Enliple AI NLP Challenge 🏆

RnDeep

velog
76.94 87.50
119 2020.06.18 BERT-Dep (single)

🏆 Enliple AI NLP Challenge 🏆

Virssist

GitHub
77.30 87.45
120 2020.06.19 [AI NLP] bert small

🏆 Enliple AI NLP Challenge 🏆

76.68 87.43
121 2020.06.19 BERT-Dep2 (single)

🏆 Enliple AI NLP Challenge 🏆

Virssist

GitHub
76.99 87.33
122 2020.06.19 korquad_v1.0_0619

🏆 Enliple AI NLP Challenge 🏆

76.81 87.33
123 2020.06.19 [AI NLP] bert small

🏆 Enliple AI NLP Challenge 🏆

76.53 87.17
124 2020.06.19 BERT-small-SeqBoost (single)

🏆 Enliple AI NLP Challenge 🏆

Yonsei Univ. | Korea Univ.

GitHub
73.27 87.11
125 2019.03.04 DocQA (single)

CLaF

75.63 85.91
126 2019.12.20 DistilBERT-base-multilingual (default huggingface) (single model)

Heeryon Cho

66.88 85.72
127 2019.03.04 BiDAF (single)

CLaF

71.88 83.00
128 2019.12.19 DistilBERT-base-multilingual (from huggingface) (single model)

Anonymous

62.90 81.29
- 2018.10.17 Baseline 71.52 82.99