INDEX
Explanations
phrases describing physical actions involving objects
occurrences of the article "a"
New Auto-Interp
Negative Logits
Own
-0.83
Sources
-0.83
Rules
-0.83
Events
-0.79
Discuss
-0.77
tests
-0.76
evidence
-0.74
Iss
-0.74
Edit
-0.73
States
-0.73
POSITIVE LOGITS
handful
1.14
bunch
1.06
rouse
0.99
broom
0.98
breeze
0.98
knife
0.96
flurry
0.95
tray
0.94
few
0.93
multitude
0.93
Activations Density 0.472%