INDEX
    Explanations

    expressions indicating permanence or frequency

    New Auto-Interp
    Negative Logits
     alberto
    -0.68
     bangkok
    -0.67
     Quod
    -0.66
     kani
    -0.65
     ikat
    -0.64
     andrea
    -0.64
     Cfr
    -0.63
     Epif
    -0.63
     guma
    -0.63
     baka
    -0.63
    POSITIVE LOGITS
     ALWAYS
    1.09
     always
    1.09
    Always
    1.04
    always
    1.03
     Always
    1.02
    ALWAYS
    1.01
    deauna
    0.90
    <bos>
    0.86
     alway
    0.83
     siempre
    0.83
    Act Density 0.185%

    No Known Activations