INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Autumn
    -0.07
     Evening
    -0.07
     protože
    -0.06
    Pokud
    -0.06
     shuttle
    -0.06
     downwards
    -0.06
     Idea
    -0.06
    acı
    -0.06
    Portland
    -0.06
     expectancy
    -0.06
    POSITIVE LOGITS
    549
    0.07
    _HOT
    0.06
    0.06
    Redux
    0.06
     نق
    0.06
    383
    0.06
    onec
    0.06
    _duplicates
    0.06
     plural
    0.06
    _WORK
    0.06
    Act Density 0.000%

    No Known Activations