INDEX
Explanations
phrases expressing opposition or caution
phrases related to opposition and calls to action against various issues
New Auto-Interp
Negative Logits
calmed
-0.78
ortality
-0.76
ector
-0.75
albeit
-0.74
eline
-0.73
oglobin
-0.70
hesion
-0.70
si
-0.70
irement
-0.70
mma
-0.69
POSITIVE LOGITS
unnecessary
1.16
undue
1.14
wasteful
1.13
excessive
1.13
wasting
1.07
needless
1.07
duplication
1.03
discriminatory
1.02
improper
1.02
discrimination
1.01
Activations Density 0.272%