INDEX
Explanations
specific countries and their presence in the text
New Auto-Interp
Negative Logits
ingly
-0.18
oga
-0.15
arguably
-0.14
mens
-0.14
Easter
-0.14
Export
-0.14
αγ
-0.14
£¨
-0.14
ãģ£ãģ
-0.13
ret
-0.13
POSITIVE LOGITS
adle
0.16
-flag
0.15
jvu
0.15
Nüfus
0.15
ToLocal
0.15
uder
0.15
fine
0.14
flags
0.14
SizeMode
0.14
abis
0.14
Activations Density 0.060%