INDEX
Explanations
Arabic prefixes
The neuron activates on Arabic words related to professions, careers, and vocational/economic terms.
New Auto-Interp
Negative Logits
“To
-0.07
LOCATION
-0.06
yr
-0.06
Tobias
-0.06
_PERSON
-0.06
pun
-0.06
Hudson
-0.06
doesn
-0.06
cock
-0.06
tt
-0.06
POSITIVE LOGITS
الب
0.08
الإن
0.07
الخ
0.07
الأم
0.07
/ch
0.07
gelen
0.07
الموقع
0.07
ите
0.07
الأسر
0.06
الأ
0.06
Activations Density 0.025%