INDEX
Explanations
phrases related to criticism, controversy, or negative events
New Auto-Interp
Negative Logits
ortality
-0.63
cellaneous
-0.58
aceae
-0.58
oneself
-0.57
srfAttach
-0.57
Alternatively
-0.57
OTAL
-0.56
depended
-0.54
myself
-0.54
Died
-0.54
POSITIVE LOGITS
bullying
0.75
crackdown
0.74
antics
0.71
mishand
0.71
allegations
0.68
transgender
0.68
encro
0.68
perceived
0.68
accusations
0.68
scandal
0.67
Activations Density 0.948%