INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    <bos>
    -0.79
    Será
    -0.64
    Nadie
    -0.63
    település
    -0.62
    alnız
    -0.61
    Примеча
    -0.61
    Alguna
    -0.60
    podr
    -0.58
    Selama
    -0.58
    Сол
    -0.58
    POSITIVE LOGITS
     gmbh
    1.10
     bayern
    1.02
     grati
    0.99
     ananas
    0.96
     magis
    0.96
     alkoh
    0.95
     Y
    0.94
     cyr
    0.93
     pessi
    0.93
     baum
    0.92
    Act Density 0.061%

    No Known Activations