INDEX
Explanations
numerical representations and their occurrences in the text
New Auto-Interp
Negative Logits
fst
-0.18
ady
-0.17
ald
-0.16
icles
-0.16
bilt
-0.16
stk
-0.16
stakes
-0.16
-0.15
ime
-0.15
enance
-0.15
POSITIVE LOGITS
Alive
0.22
éĥİ
0.20
rish
0.16
ertia
0.15
atio
0.15
errupted
0.15
uyen
0.15
éré
0.14
دا
0.14
th
0.14
Activations Density 0.062%