INDEX
    Explanations

    religious entities and places

    New Auto-Interp
    Negative Logits
    active
    0.68
    F
    0.66
     ۲
    0.63
    0.63
    carbons
    0.62
    food
    0.61
    0.60
    0.60
    Music
    0.59
    0.59
    POSITIVE LOGITS
    ع
    0.80
    ح
    0.68
    م
    0.66
    ق
    0.66
    ra
    0.64
    {
    0.64
    lla
    0.61
     on
    0.60
    li
    0.59
     Figura
    0.59
    Act Density 0.000%

    No Known Activations