INDEX
Explanations
punctuation marks and their frequency in the text
New Auto-Interp
Negative Logits
ith
-0.18
age
-0.16
anc
-0.16
Pearl
-0.15
itt
-0.15
ched
-0.15
Ãłu
-0.14
muss
-0.14
met
-0.14
idd
-0.14
POSITIVE LOGITS
ryo
0.15
à¸Ńล
0.14
pNet
0.14
ä¸ģ缮
0.14
LEAN
0.14
Ñĥд
0.14
erto
0.14
pac
0.14
.Restr
0.14
piler
0.14
Activations Density 0.015%