INDEX
Explanations
foreign characters related to specific languages
New Auto-Interp
Negative Logits
rake
-0.74
Downloadha
-0.66
ilater
-0.65
disenfranch
-0.63
Derby
-0.62
Sussex
-0.62
chau
-0.62
DRAG
-0.61
bda
-0.61
birthplace
-0.61
POSITIVE LOGITS
ħ
1.17
Į
1.05
к
1.04
ÑĤ
0.96
İ
0.94
Ð
0.93
obar
0.92
Û
0.91
ĭ
0.89
à¨
0.89
Activations Density 0.009%