INDEX
Explanations
content related to social media posts and interactions
informal communication
New Auto-Interp
Negative Logits
©¶æ¥µ
-0.88
âĢij
-0.86
,—
-0.74
glim
-0.70
FactoryReloaded
-0.69
ersen
-0.68
—"
-0.65
Footnote
-0.64
¶
-0.64
—
-0.63
POSITIVE LOGITS
@
1.40
pics
1.12
#
1.11
congr
1.08
sic
1.07
ðŁĺ
1.05
ðŁij
1.03
DonaldTrump
1.02
1.01
ðŁ
1.00
Activations Density 0.546%