INDEX
Explanations
expressions of emotional distress or frustration
New Auto-Interp
Negative Logits
liceerd
-0.45
resourceCulture
-0.41
Miß
-0.39
fitriones
-0.38
TestBed
-0.38
conjoint
-0.37
Groetjes
-0.37
XmlAccessType
-0.37
什么的
-0.37
(!)
-0.36
POSITIVE LOGITS
lmao
0.89
bruh
0.84
meme
0.82
cuck
0.81
fucking
0.79
lmfao
0.79
cringe
0.79
ngl
0.78
memes
0.76
Bruh
0.75
Activations Density 0.378%