INDEX
Explanations
numbers, particularly those representing years or dates
New Auto-Interp
Negative Logits
kili
-0.16
edir
-0.15
itage
-0.15
omik
-0.15
rosse
-0.15
urette
-0.15
irie
-0.15
ledon
-0.14
ยว
-0.14
htable
-0.14
POSITIVE LOGITS
Ras
0.16
ang
0.16
826
0.15
ased
0.15
ika
0.15
atum
0.15
uba
0.14
angkan
0.14
682
0.14
ST
0.14
Activations Density 0.011%