INDEX
    Explanations

    The neuron activates on place names and geographic region mentions.

    New Auto-Interp
    Negative Logits
     ARC
    -0.07
     مشاركة
    -0.07
    inality
    -0.06
    National
    -0.06
    声音
    -0.06
    ائد
    -0.06
    .FragmentManager
    -0.06
     physician
    -0.06
     discrimination
    -0.06
    International
    -0.06
    POSITIVE LOGITS
    leniyor
    0.06
     atrib
    0.06
    不会
    0.06
     fix
    0.06
     adm
    0.06
     infiltration
    0.05
    ák
    0.05
    σκεται
    0.05
    ाएग
    0.05
     loa
    0.05
    Act Density 0.014%

    No Known Activations