INDEX
Explanations
references to slow movement or processes
New Auto-Interp
Negative Logits
Kurz
-0.34
Swanson
-0.34
bico
-0.31
synthetic
-0.29
priv
-0.29
synthetic
-0.28
Würzburg
-0.28
Sexton
-0.28
fuga
-0.27
благодар
-0.27
POSITIVE LOGITS
slow
3.64
Slow
3.34
Slow
3.33
slow
3.30
slower
3.11
SLOW
2.94
slowest
2.91
SLOW
2.73
slows
2.73
slowed
2.73
Activations Density 0.681%