INDEX
Explanations
mentions of specific physical attributes or characteristics
instances of the article 'a' or 'an' preceding nouns
New Auto-Interp
Negative Logits
Events
-0.89
Own
-0.86
NP
-0.84
orders
-0.82
Examples
-0.80
names
-0.79
organizations
-0.78
tests
-0.78
encies
-0.77
events
-0.77
POSITIVE LOGITS
bunch
1.14
piece
1.03
pair
1.02
translucent
1.00
rectangle
1.00
towel
1.00
handful
0.99
layer
0.98
wooden
0.98
rectangular
0.97
Activations Density 0.253%