INDEX
Explanations
hyperlinks or calls to action in a text
occurrences of the word "HERE" in various contexts
New Auto-Interp
Negative Logits
gling
-0.80
bath
-0.69
egu
-0.68
utter
-0.67
omore
-0.67
ible
-0.65
urger
-0.64
usher
-0.64
eff
-0.63
arma
-0.63
POSITIVE LOGITS
HERE
1.12
NOW
0.90
THERE
0.89
BELOW
0.88
FORE
0.85
WHERE
0.85
INCLUD
0.81
NEXT
0.81
PHOTO
0.81
tical
0.80
Activations Density 0.010%