INDEX
Explanations
references to Greek language or terms
New Auto-Interp
Negative Logits
ÏİνÏĦαÏĤ
-0.15
BİL
-0.15
buc
-0.15
端
-0.15
мÑĥ
-0.15
ture
-0.14
ngu
-0.14
ruz
-0.14
tah
-0.14
nick
-0.14
POSITIVE LOGITS
ton
0.21
tôn
0.20
Paid
0.20
autos
0.20
Äĵ
0.20
pros
0.19
Kai
0.19
autos
0.19
kat
0.18
paid
0.18
Activations Density 0.019%