INDEX
Explanations
phrases related to controversies or negative events
references to secrets or conspiracies involving influential figures or groups
New Auto-Interp
Negative Logits
retained
-0.72
outset
-0.69
imentary
-0.68
centrally
-0.68
eper
-0.68
supplemented
-0.64
isites
-0.63
erial
-0.63
strengthened
-0.62
memorandum
-0.62
POSITIVE LOGITS
âĢ
1.29
Elsa
1.05
anime
1.05
Naruto
1.04
Blizz
1.04
âľ
1.03
Pokemon
1.02
ponies
1.00
Twitch
1.00
Elsa
0.99
Activations Density 0.879%