INDEX
Explanations
words related to intentional harm or deceitful actions
instances of the word "screw" and its variations
New Auto-Interp
Negative Logits
åī
-0.79
usable
-0.74
CTV
-0.71
vation
-0.69
CVE
-0.65
outh
-0.65
Interstitial
-0.64
obook
-0.64
sovere
-0.63
icipated
-0.63
POSITIVE LOGITS
driver
1.31
drivers
1.11
hole
0.94
Whedon
0.94
screws
0.91
holes
0.88
ball
0.85
screw
0.83
balls
0.82
nuts
0.76
Activations Density 0.014%