INDEX
Explanations
phrases indicating future actions or intentions
instances of the phrase "I’m going to" indicating intentions or future plans
New Auto-Interp
Negative Logits
essa
-0.58
Flavoring
-0.58
ullah
-0.58
ament
-0.56
alys
-0.56
archives
-0.55
clips
-0.55
hold
-0.54
urus
-0.53
holding
-0.53
POSITIVE LOGITS
to
1.06
overboard
0.83
nowhere
0.80
crazy
0.80
nuts
0.75
insane
0.73
ta
0.72
extinct
0.71
mad
0.69
HAM
0.67
Activations Density 0.051%