INDEX
Explanations
actions related to physically grabbing or holding something
the action of taking hold of something
New Auto-Interp
Negative Logits
xual
-0.68
withd
-0.67
Discuss
-0.64
Correction
-0.64
REDACTED
-0.63
ingen
-0.63
guy
-0.62
Correct
-0.62
conduc
-0.62
die
-0.61
POSITIVE LOGITS
bers
0.90
onto
0.86
grab
0.85
grabbing
0.83
bable
0.82
rill
0.81
bing
0.80
iques
0.78
0.74
bys
0.73
Activations Density 0.014%