INDEX
Explanations
occurrences of the letter 'A' in the text
New Auto-Interp
Negative Logits
èĻķ
-0.16
uld
-0.16
linger
-0.15
ufs
-0.15
esters
-0.15
/react
-0.14
ucus
-0.14
ubl
-0.14
rout
-0.14
alsy
-0.14
POSITIVE LOGITS
ay
0.20
nees
0.20
ham
0.20
ish
0.19
am
0.19
angan
0.19
ap
0.19
apk
0.19
je
0.19
anch
0.19
Activations Density 0.060%