INDEX
Explanations
terms related to social hierarchies or levels
references to social class or hierarchical positions within society
New Auto-Interp
Negative Logits
Machina
-0.90
tnc
-0.83
vous
-0.77
printf
-0.72
Wikipedia
-0.71
arium
-0.70
Canaver
-0.69
DVD
-0.68
Newsweek
-0.67
uality
-0.66
POSITIVE LOGITS
most
0.97
ĺħ
0.82
upper
0.80
earners
0.80
floors
0.79
bridge
0.79
shaft
0.78
thigh
0.77
obser
0.77
denomination
0.76
Activations Density 0.007%