INDEX
Explanations
phrases referring to a small quantity or number
occurrences of the word "handful."
New Auto-Interp
Negative Logits
Train
-0.72
mediate
-0.66
hemat
-0.66
ACTED
-0.62
causation
-0.62
abolic
-0.62
fet
-0.62
Adv
-0.61
structed
-0.61
poses
-0.61
POSITIVE LOGITS
een
1.01
eenth
0.98
dozen
0.96
dozen
0.90
thousand
0.82
ILCS
0.81
uously
0.79
hundred
0.78
handful
0.71
tery
0.71
Activations Density 0.012%