INDEX
Explanations
names or terms related to specific locations and individuals
proper nouns, particularly names of places or people
New Auto-Interp
Negative Logits
lda
-0.78
Traff
-0.70
umerable
-0.64
;;;;;;;;;;;;
-0.64
Violence
-0.64
lder
-0.63
Rwanda
-0.63
ORGE
-0.63
çīĪ
-0.61
selage
-0.61
POSITIVE LOGITS
arine
1.06
idge
1.04
inet
0.99
aband
0.97
atche
0.94
sburgh
0.92
ards
0.92
arians
0.90
ees
0.89
arding
0.89
Activations Density 0.054%