INDEX
    Explanations

    place names and geographical locations

    New Auto-Interp
    Negative Logits
    jan
    -0.14
    oui
    -0.14
    etto
    -0.14
     KERNEL
    -0.14
     гÑĢи
    -0.13
    backs
    -0.13
    _PM
    -0.13
    ادا
    -0.13
    åºŃ
    -0.12
    vla
    -0.12
    POSITIVE LOGITS
    ãĥĸãĥŃ
    0.16
    üstü
    0.16
     Ple
    0.15
    üst
    0.15
    lexible
    0.14
    onders
    0.14
    Ä±ÅŁÄ±k
    0.14
     Vog
    0.14
    еÑİ
    0.14
    ürger
    0.14
    Act Density 0.067%

    No Known Activations