INDEX
Explanations
phrases indicating a negative impact or worsening of a situation
phrases that indicate worsening situations or problems
New Auto-Interp
Negative Logits
orsi
-0.71
icipated
-0.69
ELD
-0.67
avage
-0.67
nered
-0.66
atti
-0.65
sha
-0.64
owes
-0.63
faced
-0.63
airo
-0.62
POSITIVE LOGITS
easier
1.31
clearer
1.19
happen
1.14
simpler
1.13
harder
1.10
worse
1.08
smoother
1.05
worthwhile
1.01
faire
0.98
safer
0.98
Activations Density 0.076%