INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    જી
    -0.09
     Dish
    -0.09
    _recipe
    -0.08
     Physi
    -0.08
    ↵↵↵↵↵↵
    -0.08
    Dish
    -0.07
    صفات
    -0.07
    (recipe
    -0.07
     overs
    -0.07
     Fringe
    -0.07
    POSITIVE LOGITS
     قطع
    0.08
    eraar
    0.08
     taut
    0.08
    0.07
     concluding
    0.07
    [href
    0.07
    ibling
    0.07
     enclosed
    0.07
     исп
    0.07
     conclude
    0.07
    Act Density 0.013%

    No Known Activations