INDEX
Explanations
references to specific locations or settings within social situations
references to people's presence and roles in specific contexts
New Auto-Interp
Negative Logits
partName
-0.72
elman
-0.71
DERR
-0.66
subp
-0.66
scroll
-0.66
Browser
-0.63
Gleaming
-0.62
asel
-0.62
aler
-0.60
reverted
-0.60
POSITIVE LOGITS
DonaldTrump
0.72
roofs
0.70
rooft
0.68
orbit
0.67
town
0.66
equation
0.65
whom
0.64
veyard
0.63
fold
0.63
anytime
0.62
Activations Density 0.376%