INDEX
Explanations
instances of the letters "gr"
New Auto-Interp
Negative Logits
an
-0.25
at
-0.24
in
-0.21
anou
-0.20
im
-0.19
id
-0.17
inis
-0.17
anlar
-0.17
and
-0.16
is
-0.16
POSITIVE LOGITS
gr
0.20
ating
0.18
inders
0.17
vine
0.16
wort
0.16
ayer
0.16
iddle
0.16
amma
0.15
INVAL
0.15
indi
0.15
Activations Density 0.007%