INDEX
Explanations
references to strongly encouraging or advising action
expressions of urging or encouraging action
New Auto-Interp
Negative Logits
icators
-0.82
Surv
-0.78
icator
-0.75
missions
-0.70
Half
-0.68
ammy
-0.67
poon
-0.64
integ
-0.63
ded
-0.63
este
-0.63
POSITIVE LOGITS
urge
1.36
urges
1.07
incent
0.99
ĸļ
0.81
urging
0.78
ingly
0.77
reminding
0.75
urged
0.72
compel
0.71
tempted
0.71
Activations Density 0.006%