INDEX
Explanations
phrases related to forceful or impactful actions, such as "blow," "bust," "kick," and "smash."
actions related to blowing or bursting something
New Auto-Interp
Negative Logits
immer
-0.75
States
-0.74
Impl
-0.69
Solution
-0.67
interrupted
-0.65
lihood
-0.65
anse
-0.64
saf
-0.61
fragment
-0.61
iaz
-0.60
POSITIVE LOGITS
burgers
0.90
elbows
0.86
shit
0.86
noses
0.84
skulls
0.83
horns
0.82
heads
0.81
popcorn
0.81
necks
0.80
asses
0.79
Activations Density 0.263%