INDEX
    Explanations

    names of places and cultural references

    New Auto-Interp
    Negative Logits
    LETTE
    -0.16
    rana
    -0.15
    ··
    -0.14
    riba
    -0.14
    ilha
    -0.14
    cade
    -0.13
    iras
    -0.13
    orf
    -0.13
    /qt
    -0.13
    etten
    -0.13
    POSITIVE LOGITS
    ìļķ
    0.15
    Ìģ
    0.14
    776
    0.14
    urance
    0.14
     Toll
    0.13
    IMUM
    0.13
     toll
    0.13
    -utils
    0.13
    ews
    0.13
    argout
    0.13
    Act Density 1.022%

    No Known Activations