INDEX
Explanations
phrases related to physical grasping or control
references to the concept of "grip" or maintaining control
New Auto-Interp
Negative Logits
mercial
-0.68
ffe
-0.67
leased
-0.66
issy
-0.63
oha
-0.62
une
-0.62
ownt
-0.62
orthy
-0.62
abad
-0.61
puting
-0.59
POSITIVE LOGITS
grip
1.49
Grip
1.32
grips
1.23
gripped
0.85
gripping
0.81
lapt
0.76
recoil
0.76
Attach
0.73
hold
0.73
FontSize
0.70
Activations Density 0.010%