INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     winds
    -0.08
     premiums
    -0.08
    رید
    -0.08
     morning
    -0.08
    ptom
    -0.07
     wages
    -0.07
    .he
    -0.07
    imized
    -0.07
    니다
    -0.07
     Winds
    -0.07
    POSITIVE LOGITS
    Arm
    0.09
     ign
    0.09
     svil
    0.08
    _extent
    0.08
    Ign
    0.07
    Fuel
    0.07
    0.07
    0.07
     સામ
    0.07
     hof
    0.07
    Act Density 0.005%

    No Known Activations