INDEX
Explanations
verbs that indicate support or endorsement of actions, initiatives, or policies
phrases related to support or endorsement
New Auto-Interp
Negative Logits
cabinet
-0.67
hatt
-0.67
fty
-0.64
hern
-0.62
Zan
-0.62
sbm
-0.62
reluct
-0.62
reau
-0.62
adder
-0.61
iece
-0.60
POSITIVE LOGITS
GMOs
0.67
auna
0.65
PHI
0.65
Brach
0.64
LW
0.62
izes
0.61
inflammation
0.60
ides
0.60
é¾įå¥ij士
0.58
copyrighted
0.58
Activations Density 0.251%