INDEX
Explanations
phrases related to locations or concepts that are physically above or below something else
references to authority and societal norms
New Auto-Interp
Negative Logits
sbm
-0.74
PLA
-0.70
Cosponsors
-0.69
alike
-0.68
ahead
-0.67
PsyNetMessage
-0.67
76561
-0.66
DAQ
-0.66
followed
-0.64
======
-0.62
POSITIVE LOGITS
bounds
1.07
confines
1.02
boundaries
0.97
limits
0.96
horizon
0.93
borders
0.88
threshold
0.87
walls
0.82
veil
0.82
limitations
0.75
Activations Density 0.166%