INDEX
    Explanations

    references to geographical locations and cultural entities

    New Auto-Interp
    Negative Logits
    roke
    -0.18
    OrNull
    -0.16
    apyrus
    -0.15
    Ïģιά
    -0.15
    aginator
    -0.15
    ç¾
    -0.14
    INET
    -0.14
    üm
    -0.14
    296
    -0.14
    olen
    -0.14
    POSITIVE LOGITS
    ii
    0.59
    ji
    0.41
    iii
    0.40
    ии
    0.38
    II
    0.37
    gii
    0.37
    iji
    0.36
    ÑĸÑĹ
    0.35
     ii
    0.32
    [ii
    0.32
    Act Density 0.020%

    No Known Activations