INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     schauen
    -0.08
     aquella
    -0.08
     voegen
    -0.08
    არია
    -0.08
    úla
    -0.08
    пен
    -0.08
     ღირს
    -0.08
     yetu
    -0.08
     павін
    -0.08
     செய்திகள்
    -0.08
    POSITIVE LOGITS
     từng
    0.07
     interest
    0.07
     sake
    0.07
    Interest
    0.07
     തര
    0.07
     zoom
    0.07
     OECD
    0.07
     COVID
    0.07
     illustration
    0.07
     tile
    0.07
    Act Density 0.050%

    No Known Activations