INDEX
Explanations
terms related to severe physical harm or negative impact
instances of the word "badly" and its variations indicating negative conditions or situations
New Auto-Interp
Negative Logits
uality
-0.88
heny
-0.74
atu
-0.74
itivity
-0.74
Parenthood
-0.73
paternity
-0.71
Citation
-0.70
itures
-0.69
iture
-0.69
cript
-0.68
POSITIVE LOGITS
beaten
0.96
wounded
0.93
lacking
0.93
mistaken
0.93
damaged
0.90
bruised
0.89
outnumbered
0.88
poisoned
0.87
needed
0.87
injured
0.87
Activations Density 0.054%