INDEX
Explanations
statements indicating potential consequences or implications
references to costs associated with projects or actions
New Auto-Interp
Negative Logits
rontal
-0.70
ukong
-0.69
kay
-0.67
acca
-0.66
ribune
-0.66
rill
-0.65
izza
-0.65
letes
-0.65
cision
-0.64
uminati
-0.64
POSITIVE LOGITS
lacked
1.17
lacks
1.06
cautioned
1.04
hindered
1.00
overshadowed
0.96
differed
0.93
stalled
0.93
hampered
0.92
balk
0.92
fails
0.91
Activations Density 0.749%