INDEX
Explanations
references to membership and belonging within groups or categories
New Auto-Interp
Negative Logits
assa
-0.18
ulp
-0.15
roperties
-0.15
sav
-0.15
ukkit
-0.15
xes
-0.14
áp
-0.14
uur
-0.14
Prefs
-0.14
loth
-0.14
POSITIVE LOGITS
czy
0.17
vr
0.16
ce
0.15
ASON
0.15
0.15
ordon
0.14
odo
0.14
ison
0.14
ason
0.14
ISON
0.13
Activations Density 0.242%