INDEX
Explanations
phrases indicating someone expressing an opinion or belief
statements indicating personal or institutional positions and declarations
New Auto-Interp
Negative Logits
ctors
-0.81
OTAL
-0.63
ername
-0.62
Sabha
-0.62
quin
-0.62
JV
-0.61
Combine
-0.59
Commerce
-0.59
hoff
-0.58
EStreamFrame
-0.57
POSITIVE LOGITS
repeatedly
1.20
publicly
0.91
since
0.86
unequivocally
0.85
avering
0.83
reluctance
0.82
contingency
0.81
numerous
0.80
intentions
0.79
lately
0.78
Activations Density 0.193%