INDEX
Explanations
Japanese to English proficiency
New Auto-Interp
Negative Logits
ot
1.23
os
1.22
ai
1.13
et
1.12
us
1.05
ER
1.05
um
0.99
be
0.98
৫
0.98
been
0.95
POSITIVE LOGITS
ان
1.08
یل
1.05
ين
1.02
ون
1.02
ية
1.01
ə
0.99
ки
0.96
した
0.93
estrict
0.93
پ
0.92
Activations Density 1.501%