INDEX
Explanations
instances of calls to action, specifically urging the reader to click for more information or to take specific steps
New Auto-Interp
Negative Logits
erring
-0.15
à¥Įà¤Ł
-0.15
hend
-0.14
autor
-0.14
URY
-0.14
gets
-0.14
uur
-0.14
uling
-0.14
_SEL
-0.14
ëĨĵ
-0.14
POSITIVE LOGITS
here
0.29
HERE
0.26
Here
0.21
ÙĩÙĨا
0.21
_here
0.20
aqui
0.20
below
0.19
aquÃŃ
0.19
è¿ĻéĩĮ
0.18
Here
0.18
Activations Density 0.014%