INDEX
Explanations
articles ('a', 'an', 'the') followed by single nouns
instances of the indefinite article "a" or "an" and their contextual associations
New Auto-Interp
Negative Logits
Sources
-0.76
Jagu
-0.74
Links
-0.73
inson
-0.70
odore
-0.69
AUD
-0.65
aci
-0.65
aliases
-0.64
killers
-0.64
Amend
-0.64
POSITIVE LOGITS
particular
1.17
bunch
1.09
person
1.08
certain
1.02
lot
1.01
uras
0.99
rouse
0.96
stranger
0.92
single
0.90
piece
0.90
Activations Density 0.306%