INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Croat
    -0.07
     міс
    -0.07
    _gray
    -0.07
     ahead
    -0.07
     layoffs
    -0.07
    िजल
    -0.06
    ir
    -0.06
     hypoth
    -0.06
    _eval
    -0.06
     setEmail
    -0.06
    POSITIVE LOGITS
     Scalar
    0.06
    abb
    0.06
     STATE
    0.06
     B
    0.06
     year
    0.05
    arsing
    0.05
    627
    0.05
    South
    0.05
    Floating
    0.05
    Major
    0.05
    Act Density 0.008%

    No Known Activations