INDEX
Explanations
instances of the word "stop" and related forms, indicating pauses or interruptions
New Auto-Interp
Negative Logits
дав
-0.17
ichten
-0.16
urally
-0.15
ensing
-0.15
ureau
-0.15
ervals
-0.15
ÑĢив
-0.15
tos
-0.15
cling
-0.14
stride
-0.14
POSITIVE LOGITS
page
0.26
lights
0.25
gap
0.24
pered
0.22
per
0.22
-motion
0.22
/start
0.22
over
0.20
light
0.19
PING
0.19
Activations Density 0.024%