INDEX
Explanations
expressions likely related to emotional and conversational content, such as exclamations, wondering, gasping, sighing, replying, and asking questions
special characters or unusual symbols in the text
New Auto-Interp
Negative Logits
confir
-1.08
osponsors
-0.88
mercial
-0.88
ividual
-0.83
espie
-0.79
ilater
-0.75
targeted
-0.74
enegger
-0.74
latest
-0.73
commercially
-0.72
POSITIVE LOGITS
¹
1.12
ł
1.01
laugh
0.91
¶ħ
0.90
ij
0.87
¡
0.87
¤
0.85
£
0.84
Damn
0.84
ĵ
0.84
Activations Density 0.270%