INDEX
Explanations
Turkish references and cultural identifiers
New Auto-Interp
Negative Logits
lar
-0.68
mı
-0.66
lı
-0.61
ların
-0.60
ları
-0.59
nak
-0.57
laş
-0.56
cı
-0.56
yla
-0.55
yı
-0.55
POSITIVE LOGITS
den
0.59
رشف
0.59
ler
0.56
AssemblyTitle
0.54
de
0.54
ől
0.54
ä
0.53
deki
0.52
lerin
0.50
onBackPressed
0.50
Activations Density 0.018%