INDEX
Explanations
occurrences of the letter 'G' in various contexts
New Auto-Interp
Negative Logits
ym
-0.20
arden
-0.19
aza
-0.18
olf
-0.18
ender
-0.18
ен
-0.18
uj
-0.17
uard
-0.17
IVEN
-0.16
ucci
-0.15
POSITIVE LOGITS
ribbon
0.22
lick
0.20
audio
0.20
uti
0.19
inz
0.19
orf
0.18
sell
0.18
utsche
0.18
liga
0.18
ou
0.18
Activations Density 0.027%