INDEX
Explanations
words related to intense emotions, particularly anger
instances of the word "rage" along with related emotional expressions
New Auto-Interp
Negative Logits
icut
-0.85
ramer
-0.83
herty
-0.79
rica
-0.71
coerc
-0.71
lder
-0.68
nai
-0.68
pse
-0.67
arent
-0.67
Liberties
-0.66
POSITIVE LOGITS
quit
1.06
rage
1.00
fury
0.89
raging
0.81
furnace
0.80
ï¸
0.78
bol
0.77
TEXTURE
0.76
ously
0.72
vengeance
0.71
Activations Density 0.019%