INDEX
Explanations
differences or alternatives between two choices or options
expressions of opposition or dissent
New Auto-Interp
Negative Logits
negie
-0.93
oufl
-0.77
hack
-0.75
ammy
-0.74
ramid
-0.74
opia
-0.73
argo
-0.73
raz
-0.71
overed
-0.71
amaz
-0.70
POSITIVE LOGITS
opposed
0.94
vehemently
0.86
thereto
0.84
voc
0.82
aback
0.82
minded
0.73
disposed
0.72
onent
0.72
cens
0.72
foes
0.69
Activations Density 0.013%