INDEX
Explanations
sentences that contain punctuation, specifically periods
New Auto-Interp
Negative Logits
aar
-0.16
x
-0.14
ience
-0.14
compass
-0.14
emin
-0.14
aat
-0.13
Cain
-0.13
oms
-0.13
бÑĥдÑĤо
-0.13
pred
-0.12
POSITIVE LOGITS
rient
0.15
ê¸ī
0.15
ıs
0.15
iversit
0.15
raq
0.15
odyn
0.14
shm
0.14
zeÅĦ
0.13
.her
0.13
agues
0.13
Activations Density 0.017%