INDEX
Explanations
words related to strong negative sentiments or emotions, particularly hatred
expressions of dislike or hatred toward various subjects
New Auto-Interp
Negative Logits
DragonMagazine
-1.08
igmatic
-0.93
ItemImage
-0.89
OGR
-0.83
aunder
-0.82
enture
-0.78
arov
-0.78
eva
-0.77
akeru
-0.76
aqu
-0.75
POSITIVE LOGITS
fully
1.03
hated
0.96
wasting
0.88
hate
0.86
Mondays
0.85
lessly
0.83
hate
0.82
hates
0.77
dearly
0.76
bullies
0.72
Activations Density 0.061%