INDEX
Explanations
specific names and references related to places or historical figures
New Auto-Interp
Negative Logits
ydk
-0.17
dejting
-0.16
ÃŃk
-0.16
à¤Ĥय
-0.15
thaimassage
-0.15
ayment
-0.15
Haw
-0.15
iyet
-0.15
ikk
-0.15
ÐĶжон
-0.15
POSITIVE LOGITS
Lomb
0.25
Ferr
0.25
Tos
0.24
milan
0.24
Milan
0.24
Milano
0.23
Bernardino
0.23
Francesco
0.22
Giul
0.22
Umb
0.22
Activations Density 0.019%