INDEX
Explanations
terms related to growth rates and how they change over time
New Auto-Interp
Negative Logits
priv
-0.34
fuga
-0.31
horabuena
-0.30
立派
-0.29
forfeited
-0.29
umptive
-0.28
ợp
-0.28
skar
-0.28
Swanson
-0.28
chtigkeit
-0.28
POSITIVE LOGITS
slow
3.73
Slow
3.42
Slow
3.41
slow
3.39
slower
3.17
SLOW
3.03
slowest
2.94
SLOW
2.81
slows
2.66
slowed
2.64
Activations Density 0.801%