INDEX
Explanations
occurrences of the word "Hill" and its variations in various contexts
New Auto-Interp
Negative Logits
yne
-0.16
kla
-0.16
vais
-0.16
sak
-0.15
ecz
-0.15
s
-0.15
ooke
-0.14
icha
-0.14
samp
-0.14
avian
-0.14
POSITIVE LOGITS
side
0.48
iard
0.43
top
0.38
crest
0.31
arious
0.30
iards
0.25
ides
0.24
ier
0.24
ary
0.23
bill
0.23
Activations Density 0.012%