INDEX
Explanations
moments of pausing or stopping in action or thought
New Auto-Interp
Negative Logits
Ĩµ
-0.16
jac
-0.15
indsight
-0.15
_REGISTRY
-0.14
opoulos
-0.14
hari
-0.14
erd
-0.14
ово
-0.13
osten
-0.13
-0.13
POSITIVE LOGITS
traffic
0.19
mid
0.18
briefly
0.18
dead
0.17
traffic
0.17
halted
0.17
orr
0.17
proceedings
0.16
for
0.16
paused
0.16
Activations Density 0.059%