INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    imu
    -0.07
    _Key
    -0.07
     Ebola
    -0.07
     husus
    -0.06
     slavery
    -0.06
    社會
    -0.06
     nog
    -0.06
    ahy
    -0.06
    utr
    -0.06
     فرهنگ
    -0.06
    POSITIVE LOGITS
     Modern
    0.08
     Prints
    0.08
     distress
    0.07
     stint
    0.07
     prints
    0.07
     proud
    0.06
    .features
    0.06
     strength
    0.06
     print
    0.06
     California
    0.06
    Act Density 0.003%

    No Known Activations