INDEX
Explanations
references to food items
the article "a" in various contexts
New Auto-Interp
Negative Logits
evidence
-0.92
agree
-0.68
reports
-0.67
Attempts
-0.66
enance
-0.66
impact
-0.65
Events
-0.64
Att
-0.64
aneously
-0.64
assessments
-0.63
POSITIVE LOGITS
bunch
1.28
lot
1.25
few
1.14
couple
1.10
handful
1.09
nice
1.01
little
0.97
glimpse
0.96
rouse
0.96
bit
0.95
Activations Density 1.091%