INDEX
Explanations
instances of the word "River" or its variations in the text
New Auto-Interp
Negative Logits
vla
-0.09
upe
-0.08
ignum
-0.08
å®ħ
-0.08
jvu
-0.07
antry
-0.07
eyle
-0.07
ãĥ©ãĤ¯
-0.07
rupa
-0.07
erais
-0.07
POSITIVE LOGITS
front
0.08
ine
0.07
dale
0.07
compromise
0.07
ks
0.06
Herman
0.06
fy
0.06
point
0.06
lining
0.06
breeze
0.05
Activations Density 0.012%