INDEX
Explanations
modal verbs indicating possibility or permission
modal verbs indicating possibility or probability
New Auto-Interp
Negative Logits
Crunch
-0.70
jri
-0.64
ament
-0.64
Glob
-0.63
reigning
-0.63
ATS
-0.62
essors
-0.61
Revision
-0.60
ILA
-0.59
executions
-0.58
POSITIVE LOGITS
prefer
1.15
choose
1.10
decide
1.08
want
1.06
opt
1.06
mistakenly
1.02
hesitate
0.98
perceive
0.97
inadvertently
0.95
argue
0.95
Activations Density 0.143%