INDEX
Explanations
words related to negative impacts or harms, particularly with a strong negative effect
references to negative impacts or setbacks
New Auto-Interp
Negative Logits
iosity
-0.79
guiIcon
-0.72
cius
-0.67
Definition
-0.67
rian
-0.66
orkshire
-0.66
HCR
-0.65
afort
-0.65
heid
-0.65
pora
-0.64
POSITIVE LOGITS
blow
0.90
outs
0.88
blow
0.88
blows
0.84
retard
0.82
pipe
0.81
hole
0.81
out
0.81
bang
0.81
Blow
0.79
Activations Density 0.012%