INDEX
Explanations
numeric values and identifiers
New Auto-Interp
Negative Logits
aty
-0.16
anca
-0.14
asion
-0.14
ardin
-0.14
-su
-0.13
ulus
-0.13
aren
-0.13
igner
-0.13
Pret
-0.13
-strokes
-0.13
POSITIVE LOGITS
Ħ
0.15
raising
0.15
psc
0.14
phinx
0.14
795
0.14
Envelope
0.14
Voll
0.14
redients
0.14
iversal
0.13
á»ĥn
0.13
Activations Density 0.019%