INDEX
Explanations
topics related to change and transition over time
New Auto-Interp
Negative Logits
ëĮĢë¡ľ
-0.14
زÙĨ
-0.14
REE
-0.13
æ¬ł
-0.13
åºŃ
-0.13
ÑģÑĮко
-0.13
Sadd
-0.13
ong
-0.13
ómo
-0.13
æĺĩ
-0.13
POSITIVE LOGITS
subs
0.57
subs
0.43
dissip
0.34
ab
0.33
peter
0.29
rec
0.28
_subs
0.27
diss
0.26
settle
0.26
tapered
0.26
Activations Density 0.225%