INDEX
Explanations
general terms that are commonly associated with various types of groups or situations
references to general group characteristics or commonalities
New Auto-Interp
Negative Logits
NI
-0.71
ornia
-0.71
Pad
-0.70
kefeller
-0.69
anooga
-0.67
ARE
-0.67
dor
-0.66
bley
-0.66
thouse
-0.66
ħĭ
-0.65
POSITIVE LOGITS
others
0.86
other
0.83
else
0.74
nascent
0.73
great
0.72
revolutions
0.70
good
0.70
mammals
0.68
totalitarian
0.67
civilized
0.67
Activations Density 0.095%