INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    grafa
    -0.45
    })+
    -0.45
    Counter
    -0.43
     linh
    -0.41
     annual
    -0.41
    getAction
    -0.41
    -0.41
     pro
    -0.41
    nfl
    -0.40
     count
    -0.40
    POSITIVE LOGITS
    Vidite
    0.78
     متعلقه
    0.74
     betweenstory
    0.73
     nahilalakip
    0.73
    InjectAttribute
    0.73
    PreferredItem
    0.68
     يتيمه
    0.66
    OGND
    0.65
    0.65
     Италијани
    0.64
    Act Density 0.011%

    No Known Activations