INDEX
Explanations
expressions of frustration or strong emotions
New Auto-Interp
Negative Logits
policymakers
-0.74
ielding
-0.68
quartered
-0.66
asury
-0.64
markedly
-0.63
strikingly
-0.62
outset
-0.62
broadly
-0.61
incumbent
-0.61
Footnote
-0.61
POSITIVE LOGITS
;)
1.34
fuckin
1.32
haha
1.24
kinda
1.21
lol
1.20
:)
1.18
shit
1.18
:(
1.17
bitch
1.15
!!!!!
1.14
Activations Density 10.318%