INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     cowork
    -0.10
     coworkers
    -0.09
    פל
    -0.08
     beforehand
    -0.08
     आम
    -0.08
     immigrants
    -0.08
    wię
    -0.08
     revue
    -0.07
     immigration
    -0.07
     uyg
    -0.07
    POSITIVE LOGITS
     Hu
    0.09
     Sounds
    0.08
     loops
    0.08
    gia
    0.08
     sounds
    0.08
    Sounds
    0.08
     weakened
    0.08
     свою
    0.07
     cheerful
    0.07
     versterken
    0.07
    Act Density 0.003%

    No Known Activations