INDEX
Explanations
Twitter handles and online usernames
mentions of social media platforms or related handles
New Auto-Interp
Negative Logits
condensed
-0.65
bombard
-0.64
intimidation
-0.62
caring
-0.62
memos
-0.62
afore
-0.62
loosely
-0.61
condol
-0.61
commuting
-0.61
buildup
-0.61
POSITIVE LOGITS
Gh
1.24
Cu
1.22
OY
1.20
Ru
1.18
NK
1.17
RD
1.16
RN
1.16
qs
1.16
Tu
1.15
Ay
1.15
Activations Density 0.053%