INDEX
Explanations
mentions of individuals and their actions or characteristics
elements related to sports and notable achievements
New Auto-Interp
Negative Logits
goddamn
-0.69
itized
-0.65
Fuck
-0.62
]]
-0.60
hetic
-0.60
...)
-0.59
Mods
-0.58
.[
-0.58
Quantity
-0.58
â̦)
-0.58
POSITIVE LOGITS
criticised
0.89
controvers
0.78
angered
0.72
ousted
0.72
spokesman
0.72
sacked
0.70
embroiled
0.68
outgoing
0.67
hailed
0.66
controversial
0.66
Activations Density 1.475%