INDEX
Explanations
occurrences of the letter 'b' in various contexts
New Auto-Interp
Negative Logits
uyết
-0.16
illard
-0.15
ffect
-0.14
usterity
-0.14
ÑĢайонÑĥ
-0.14
skoro
-0.14
kola
-0.13
/type
-0.13
utow
-0.13
arbon
-0.13
POSITIVE LOGITS
imed
0.17
INES
0.16
ted
0.16
iven
0.16
inen
0.15
ok
0.14
framework
0.14
itta
0.14
ined
0.14
ast
0.14
Activations Density 0.037%