INDEX
    Explanations

    references to places or concepts of belonging and residency

    New Auto-Interp
    Negative Logits
    pers
    -0.16
    alth
    -0.16
    umont
    -0.15
     Reese
    -0.14
    =-=-=-=-
    -0.14
    βα
    -0.14
     Ivan
    -0.14
    _shadow
    -0.14
    essen
    -0.13
    illard
    -0.13
    POSITIVE LOGITS
    oky
    0.16
    eras
    0.15
    iky
    0.14
    PRI
    0.13
     ex
    0.13
    MBER
    0.13
    ãĥ¼ãĥĸãĥ«
    0.13
     اÙĦÙħص
    0.13
    ienda
    0.13
    aca
    0.13
    Act Density 0.080%

    No Known Activations