INDEX
Explanations
proper nouns, particularly associated with religious figures and locations
New Auto-Interp
Negative Logits
izmet
-0.15
senal
-0.15
ãĤĥ
-0.14
átis
-0.14
ince
-0.14
kke
-0.14
pline
-0.14
WaitForSeconds
-0.13
رÙħ
-0.13
-ST
-0.13
POSITIVE LOGITS
Mir
0.16
intens
0.15
Sau
0.14
ẩn
0.14
already
0.14
numbered
0.13
orden
0.13
'
0.13
Gu
0.13
bite
0.13
Activations Density 0.312%