INDEX
Explanations
instances where the writer emphasizes a high quantity or intensity of something
New Auto-Interp
Negative Logits
ħĭ
-0.81
saf
-0.80
pherd
-0.76
selves
-0.75
othy
-0.75
Ń·
-0.74
²¾
-0.73
swer
-0.71
byn
-0.70
İĭ
-0.70
POSITIVE LOGITS
emotion
0.81
firepower
0.81
hype
0.79
crap
0.75
shit
0.75
dmg
0.74
attention
0.74
fun
0.73
negativity
0.72
stuff
0.71
Activations Density 0.031%