INDEX
Explanations
negative or critical words and phrases
variations of the word "play."
New Auto-Interp
Negative Logits
âĸ¬
-0.77
HAHA
-0.73
âĸ¬âĸ¬
-0.73
Logged
-0.68
////////////////////////////////
-0.67
ä¸Ģ
-0.65
fav
-0.63
spin
-0.63
ĪĴ
-0.63
QUI
-0.63
POSITIVE LOGITS
acement
1.25
asma
1.21
atinum
1.17
enty
1.15
umbing
1.05
acements
1.02
icably
1.01
ains
1.00
atoon
0.98
atter
0.98
Activations Density 0.014%