INDEX
Explanations
terms associated with conflict and societal issues
preceding verbs or nouns
causes negative outcomes
New Auto-Interp
Negative Logits
HtmlAttribute
-0.65
Viited
-0.53
sumpay
-0.49
macam
-0.47
mitting
-0.45
without
-0.44
lives
-0.43
сылкі
-0.43
DispatchToProps
-0.42
quelles
-0.42
POSITIVE LOGITS
ensures
1.08
helps
1.02
gives
0.92
brings
0.90
makes
0.90
ensure
0.88
enables
0.88
helped
0.88
sprawia
0.88
ทำให้
0.88
Activations Density 0.577%