INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    antar
    -0.15
    èĪį
    -0.15
    lug
    -0.15
    SCI
    -0.14
    št
    -0.14
    achsen
    -0.14
    town
    -0.14
    oftware
    -0.14
    odesk
    -0.14
    åIJĽ
    -0.14
    POSITIVE LOGITS
    yer
    0.16
     shar
    0.14
    gent
    0.14
    gebn
    0.14
     shaded
    0.14
    .Apply
    0.13
     caval
    0.13
    SEMB
    0.13
     Geneva
    0.13
     swe
    0.13
    Act Density 0.026%

    No Known Activations