INDEX
Explanations
the indefinite article "an" in various contexts
New Auto-Interp
Negative Logits
Davies
-0.15
baÅŁÄ±na
-0.14
gil
-0.14
215
-0.14
nout
-0.13
ingroup
-0.13
tember
-0.13
Past
-0.13
inear
-0.13
gaard
-0.13
POSITIVE LOGITS
ther
0.31
archy
0.20
iling
0.19
ointed
0.19
ony
0.18
imes
0.18
/the
0.18
thers
0.17
agrams
0.16
ulled
0.16
Activations Density 0.244%