INDEX
    Explanations

    references to geographical locations, particularly cities and countries

    New Auto-Interp
    Negative Logits
    orida
    -0.16
    utex
    -0.16
     sag
    -0.15
    iç
    -0.14
    µ¬
    -0.14
    osit
    -0.14
    inge
    -0.14
    ÅĻÃŃd
    -0.14
    ono
    -0.14
    ague
    -0.14
    POSITIVE LOGITS
     Sinai
    0.15
    rewrite
    0.15
    izzato
    0.14
    inan
    0.14
    kad
    0.14
    igg
    0.14
    ëģ
    0.14
     Windsor
    0.14
    eyer
    0.14
    itan
    0.13
    Act Density 0.016%

    No Known Activations