INDEX
Explanations
descriptors related to violent actions and their consequences
New Auto-Interp
Negative Logits
creep
-0.17
overnight
-0.17
beck
-0.16
knots
-0.14
Finger
-0.14
Overflow
-0.14
Battles
-0.14
handjob
-0.14
creeping
-0.14
ahn
-0.14
POSITIVE LOGITS
impact
0.30
Impact
0.28
impact
0.27
Impact
0.25
impacts
0.24
landing
0.24
landing
0.21
impacting
0.21
Landing
0.20
rico
0.20
Activations Density 0.380%