INDEX
Explanations
mentions of the English language
occurrences of the word "English" in various contexts
New Auto-Interp
Negative Logits
enges
-0.79
psy
-0.78
uder
-0.75
atl
-0.73
ĸļ
-0.73
Ara
-0.72
stals
-0.71
cients
-0.71
Downloadha
-0.70
ertodd
-0.70
POSITIVE LOGITS
translation
1.06
translations
1.00
language
0.98
speaking
0.96
muff
0.96
subtitles
0.96
shire
0.91
man
0.90
Language
0.90
language
0.88
Activations Density 0.024%