INDEX
Explanations
instances of the letter "B" in various forms and contexts
New Auto-Interp
Negative Logits
Gro
-0.16
gro
-0.16
gro
-0.15
ÑĢаниÑĨ
-0.15
ture
-0.14
490
-0.14
undi
-0.14
ianne
-0.14
514
-0.14
Aqu
-0.14
POSITIVE LOGITS
weep
0.22
rawl
0.20
itch
0.20
legg
0.19
angers
0.19
loop
0.19
LOOP
0.18
leep
0.17
oppers
0.17
uble
0.17
Activations Density 0.047%