INDEX
    Explanations

    words related to specific geographical or geopolitical topics

    New Auto-Interp
    Negative Logits
     Bene
    -0.16
    ษ
    -0.15
    olia
    -0.14
    erb
    -0.14
    am
    -0.14
     Ken
    -0.14
     Deus
    -0.14
     Norm
    -0.14
     Heights
    -0.14
    amo
    -0.14
    POSITIVE LOGITS
    owy
    0.24
    ový
    0.20
    owych
    0.19
    оваÑı
    0.16
    ye
    0.16
    nic
    0.15
    овÑĭе
    0.15
    owego
    0.15
    ny
    0.15
    наÑı
    0.15
    Act Density 0.094%

    No Known Activations