INDEX
Explanations
names of political figures
highly active verbs and phrases related to gestures and actions in political or social contexts
New Auto-Interp
Negative Logits
pled
-0.73
sooner
-0.69
netflix
-0.68
equivalents
-0.66
accordingly
-0.64
extrap
-0.64
ourselves
-0.64
haps
-0.63
psycho
-0.61
DV
-0.61
POSITIVE LOGITS
WATCHED
1.05
window
1.02
allery
0.82
OTOS
0.82
hello
0.80
VIDEOS
0.76
ï
0.75
UTERS
0.74
PHOTO
0.74
photo
0.74
Activations Density 0.550%