INDEX
Explanations
instances of the word "walk" and its variations within the context
New Auto-Interp
Negative Logits
hw
-0.16
frei
-0.15
eve
-0.15
Downs
-0.15
ooke
-0.14
nici
-0.14
lef
-0.14
565
-0.14
aeper
-0.14
amus
-0.13
POSITIVE LOGITS
arella
0.19
chedulers
0.15
ody
0.14
0.14
erman
0.14
leÅŁik
0.14
jišť
0.14
adel
0.14
Jenn
0.14
ีà¸ŀ
0.14
Activations Density 0.049%