INDEX
Explanations
instances of the letter 'h' and variations in its usage within text
New Auto-Interp
Negative Logits
831
-0.17
erez
-0.17
uchos
-0.16
idence
-0.15
биÑĤ
-0.15
ziej
-0.15
ило
-0.15
hani
-0.14
lei
-0.14
erne
-0.14
POSITIVE LOGITS
undra
0.22
uv
0.20
ela
0.19
ustr
0.19
als
0.17
vide
0.17
ov
0.17
onom
0.16
oved
0.16
448
0.16
Activations Density 0.006%