INDEX
Explanations
mentions of pressure or urging for action
occurrences of the word "pressed" and its variations
New Auto-Interp
Negative Logits
ĪĴ
-0.72
cknowled
-0.67
flame
-0.64
cliff
-0.63
Lights
-0.63
BIT
-0.63
verty
-0.62
bows
-0.60
¯¯¯¯
-0.60
fin
-0.58
POSITIVE LOGITS
pressed
1.12
pressed
0.93
press
0.90
presses
0.87
entary
0.82
pressure
0.81
ioned
0.79
urized
0.77
roth
0.76
pressing
0.75
Activations Density 0.012%