INDEX
Explanations
references to Greek terminology and their meanings
New Auto-Interp
Negative Logits
omon
-0.15
esi
-0.15
ider
-0.14
iesz
-0.14
ello
-0.14
alette
-0.14
eso
-0.13
ucht
-0.13
illery
-0.13
ernels
-0.13
POSITIVE LOGITS
uar
0.16
istrovstvÃŃ
0.16
.tif
0.15
kt
0.15
ÙĪØ³ÛĮ
0.14
пе
0.14
álido
0.14
oyer
0.13
é¢ij次
0.13
èĢ
0.13
Activations Density 0.028%