INDEX
Explanations
phrases indicating uncertainty or possibility
phrases that express potentiality or conditionality
New Auto-Interp
Negative Logits
eye
-0.70
resso
-0.69
Bene
-0.65
athing
-0.64
Dear
-0.64
ament
-0.63
raint
-0.62
bern
-0.62
Columb
-0.62
arthed
-0.61
POSITIVE LOGITS
require
1.02
escalate
0.99
delay
0.99
vary
0.98
postpone
0.98
shorten
0.97
exacerbate
0.96
complicate
0.94
modify
0.94
involve
0.93
Activations Density 0.188%