INDEX
Explanations
occurrences of the article "a" and variations thereof, indicating a focus on indefinite articles in context
New Auto-Interp
Negative Logits
bble
-0.18
vale
-0.17
cts
-0.16
Mask
-0.15
nict
-0.15
rs
-0.14
bon
-0.14
hen
-0.14
al
-0.14
078
-0.14
POSITIVE LOGITS
causa
0.22
seg
0.19
liv
0.19
eree
0.19
posterior
0.19
mpi
0.18
ereo
0.18
ffer
0.18
fian
0.17
caval
0.17
Activations Density 0.002%