INDEX
Explanations
actions involving physical force or effort, such as pushing, pulling, grabbing, and tugging
actions involving physical force or manipulation
New Auto-Interp
Negative Logits
Surviv
-0.77
mun
-0.74
ãĥīãĥ©ãĤ´ãĥ³
-0.73
Broadcast
-0.70
Deal
-0.70
ãĥİ
-0.69
league
-0.69
天
-0.68
mberg
-0.67
Fallen
-0.67
POSITIVE LOGITS
unconscious
0.84
joints
0.79
torches
0.78
glide
0.78
lasses
0.77
jerk
0.76
limp
0.74
downwards
0.74
gently
0.73
fists
0.73
Activations Density 0.141%