INDEX
Explanations
phrases related to capabilities or possibilities
modal verbs indicating capability or possibility
New Auto-Interp
Negative Logits
rill
-0.67
zel
-0.67
DRAG
-0.67
zin
-0.64
ult
-0.61
issions
-0.61
well
-0.60
oglu
-0.59
especially
-0.59
2020
-0.58
POSITIVE LOGITS
anymore
1.02
nor
0.85
nor
0.84
OTAL
0.72
Eag
0.70
semblance
0.68
ever
0.68
ever
0.67
spor
0.65
meas
0.65
Activations Density 0.333%