INDEX
Explanations
terms associated with governmental or organizational structures
New Auto-Interp
Negative Logits
ruba
-0.17
Bars
-0.16
Sesso
-0.15
rodin
-0.15
prung
-0.15
adele
-0.14
ENUM
-0.14
729
-0.14
ERA
-0.14
afia
-0.13
POSITIVE LOGITS
stone
0.14
ainless
0.14
zsche
0.14
passive
0.14
-blood
0.13
stay
0.13
ech
0.13
ince
0.13
chet
0.13
ing
0.13
Activations Density 0.866%