INDEX
Explanations
names and terms related to political figures and organizations
instances of the letter combinations indicative of linguistic variations or patterns
New Auto-Interp
Negative Logits
Mehran
-0.83
hovah
-0.74
uni
-0.71
doi
-0.71
Nare
-0.62
reckoning
-0.61
hler
-0.61
utra
-0.60
numbered
-0.60
éŃĶ
-0.58
POSITIVE LOGITS
enment
0.89
tarian
0.86
ateral
0.81
ibly
0.81
ional
0.78
ãĥ¼ãĥĨ
0.77
ables
0.75
ATION
0.72
Stevenson
0.70
itely
0.67
Activations Density 0.235%