INDEX
Explanations
words related to disapproval or criticism
words related to negative judgment or condemnation
New Auto-Interp
Negative Logits
emade
-0.76
ahime
-0.75
unin
-0.75
ãĥ¯
-0.74
iasm
-0.74
glers
-0.73
inn
-0.71
ãĥ³ãĤ¸
-0.71
heed
-0.71
ão
-0.70
POSITIVE LOGITS
Sparkle
0.65
Parents
0.64
payable
0.63
osponsors
0.62
INGTON
0.62
ately
0.61
vier
0.60
Rising
0.60
reven
0.60
ORED
0.60
Activations Density 0.042%