INDEX
Explanations
mentions of the social media platform "Twitter"
mentions of Twitter and its related context
New Auto-Interp
Negative Logits
Magikarp
-0.77
bard
-0.74
senal
-0.71
igated
-0.65
DEN
-0.65
Karin
-0.64
blush
-0.63
++++++++++++++++
-0.62
tenance
-0.60
igating
-0.60
POSITIVE LOGITS
yk
1.11
ares
0.96
elfth
0.96
orks
0.95
ipes
0.91
icket
0.86
urst
0.85
oria
0.85
orable
0.84
ör
0.84
Activations Density 0.015%