INDEX
Explanations
punctuation marks and their patterns in sentences
New Auto-Interp
Negative Logits
hibit
-0.17
xac
-0.15
ingly
-0.14
imax
-0.14
евÑĸ
-0.13
antage
-0.13
>,</
-0.13
("")]↵-0.13
hic
-0.13
ungen
-0.13
POSITIVE LOGITS
outside
0.30
Outside
0.30
Outside
0.28
aside
0.28
Aside
0.25
outside
0.25
apart
0.24
hobbies
0.24
ngoÃłi
0.23
Favorite
0.23
Activations Density 0.111%