INDEX
Explanations
adjectives related to negative impact, such as 'hurt'
instances of the word "hurt" in various contexts
New Auto-Interp
Negative Logits
DragonMagazine
-0.69
vironment
-0.68
iasco
-0.65
au
-0.64
uties
-0.64
clerosis
-0.64
guyen
-0.63
aut
-0.63
aer
-0.62
pedigree
-0.62
POSITIVE LOGITS
ful
0.98
hurt
0.91
onies
0.87
hurts
0.81
headed
0.80
hurting
0.78
igue
0.78
feelings
0.78
fully
0.76
lehem
0.75
Activations Density 0.011%