INDEX
    Explanations

    expressions of speculation or uncertainty

    New Auto-Interp
    Negative Logits
    ugo
    -0.16
    ropri
    -0.15
    lique
    -0.15
    ajs
    -0.15
    izu
    -0.15
    abis
    -0.14
    lov
    -0.14
    наÑĤ
    -0.14
    eft
    -0.14
    criptors
    -0.14
    POSITIVE LOGITS
    996
    0.17
    659
    0.15
     Bund
    0.15
    pher
    0.15
    662
    0.15
    gens
    0.14
    787
    0.14
    çŁ
    0.14
    iele
    0.14
    ija
    0.14
    Act Density 0.187%

    No Known Activations