INDEX
    Explanations

    words and phrases related to geographical locations and cultural aspects

    New Auto-Interp
    Negative Logits
    ibold
    -0.17
     Bien
    -0.16
    zioni
    -0.15
    ái
    -0.15
     AO
    -0.15
    SSF
    -0.15
    Bien
    -0.15
    ाà¤ı
    -0.15
     Clem
    -0.14
    cio
    -0.14
    POSITIVE LOGITS
    ina
    0.31
    ona
    0.30
    ica
    0.29
    ÙĪÙĦا
    0.29
    ula
    0.29
    ffa
    0.28
    ira
    0.28
    ola
    0.28
    ela
    0.28
    unda
    0.28
    Act Density 0.318%

    No Known Activations