INDEX
Explanations
words and phrases related to collective identity and community structure
New Auto-Interp
Negative Logits
izado
-0.16
izons
-0.15
isation
-0.15
itate
-0.15
izada
-0.14
izados
-0.14
iore
-0.14
ovu
-0.14
reau
-0.14
izr
-0.14
POSITIVE LOGITS
ige
0.39
liche
0.35
lich
0.35
ig
0.32
lichen
0.32
licher
0.30
iger
0.29
elijk
0.28
igen
0.28
elijke
0.27
Activations Density 0.069%