INDEX
Explanations
references to classic literature and notable authors
New Auto-Interp
Negative Logits
Gaul
-0.16
okable
-0.15
occo
-0.15
Scholars
-0.14
apot
-0.14
аÑĢам
-0.14
adera
-0.14
presso
-0.14
eken
-0.14
reau
-0.13
POSITIVE LOGITS
contempor
0.18
hala
0.15
æŀļ
0.14
edeki
0.14
Ðİ
0.14
Sydney
0.14
Soph
0.14
Wells
0.13
drill
0.13
åº
0.13
Activations Density 0.172%