INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     نور
    -0.07
     ped
    -0.07
     Breath
    -0.06
    -cycle
    -0.06
    _created
    -0.06
    иш
    -0.06
     dong
    -0.06
    vre
    -0.06
    Ingredient
    -0.06
     breath
    -0.06
    POSITIVE LOGITS
     ارزی
    0.06
     MHz
    0.06
    ترنت
    0.06
     australia
    0.06
     محاس
    0.06
    ometimes
    0.06
    0.06
     kWh
    0.06
     розвиток
    0.06
    rror
    0.06
    Act Density 0.006%

    No Known Activations