INDEX
Explanations
sensitive issues or controversy in a text
indicators of significant issues or warnings
New Auto-Interp
Negative Logits
etheless
-0.87
hement
-0.83
unprotected
-0.81
scrap
-0.71
endeav
-0.71
indu
-0.70
succumb
-0.69
streng
-0.68
afloat
-0.67
conscious
-0.67
POSITIVE LOGITS
Newsletter
0.97
Loading
0.97
Wilson
0.96
Said
0.95
Official
0.94
Posted
0.93
SPONSORED
0.90
Comment
0.90
Anonymous
0.88
Trivia
0.87
Activations Density 0.197%