INDEX
Explanations
instances of the word "ho" and variations of its capitalization
New Auto-Interp
Negative Logits
bers
-0.17
mente
-0.16
ον
-0.16
borg
-0.15
vers
-0.14
kes
-0.14
cede
-0.14
ners
-0.14
phinx
-0.14
bur
-0.14
POSITIVE LOGITS
resh
0.20
ho
0.19
izontal
0.18
ặc
0.18
isting
0.18
arding
0.17
Ho
0.17
arded
0.17
iteli
0.17
oven
0.16
Activations Density 0.008%