INDEX
Explanations
numbers representing quantities or ranges
phrases that express quantities or estimates involving ranges
New Auto-Interp
Negative Logits
eval
-0.72
ires
-0.68
eless
-0.68
ocrat
-0.62
Cycling
-0.62
Haku
-0.62
ired
-0.62
Reporter
-0.61
ergic
-0.61
palp
-0.60
POSITIVE LOGITS
chard
1.18
acle
1.17
ifice
1.10
ific
1.08
acles
1.07
acular
1.05
nam
1.05
lando
1.02
chid
1.02
thodox
0.94
Activations Density 0.082%