INDEX
Explanations
references to national entities or national-level issues
New Auto-Interp
Negative Logits
ØŃداث
-0.17
anuts
-0.17
ä¸Ī
-0.16
431
-0.16
inz
-0.15
TINGS
-0.15
angen
-0.15
kova
-0.14
важ
-0.14
ëij
-0.14
POSITIVE LOGITS
/local
0.19
eres
0.19
/reg
0.17
ych
0.16
Tos
0.15
elow
0.15
okes
0.15
opes
0.14
ities
0.14
andum
0.14
Activations Density 0.028%