INDEX
Explanations
phrases expressing various degrees of evaluation or judgment, particularly focusing on the phrase "at best"
phrases indicating varying degrees of quality or effectiveness
New Auto-Interp
Negative Logits
selves
-0.68
condition
-0.61
ISH
-0.61
forth
-0.61
illance
-0.60
lish
-0.60
FTWARE
-0.59
oku
-0.58
HAM
-0.58
endum
-0.57
POSITIVE LOGITS
times
1.25
best
1.02
least
0.89
heart
0.88
worst
0.88
mosp
0.83
first
0.82
olls
0.80
onal
0.79
odds
0.74
Activations Density 0.086%