INDEX
Explanations
verbs related to analyzing, speculating, or evaluating outcomes
phrases indicating conflict, change, and critical evaluation of situations
New Auto-Interp
Negative Logits
robe
-0.69
ufact
-0.66
bill
-0.65
ench
-0.63
artney
-0.61
amina
-0.61
nik
-0.58
Bride
-0.58
apon
-0.57
ouver
-0.57
POSITIVE LOGITS
causation
0.72
Corker
0.67
abound
0.67
omission
0.65
ãĥ¼ãĥĨ
0.64
unanswered
0.63
Rosenstein
0.62
Background
0.62
bias
0.61
nonetheless
0.60
Activations Density 2.364%