INDEX
    Explanations

    proper nouns, specifically names of individuals or entities

    New Auto-Interp
    Negative Logits
    ãĤ¼ãĤ¦ãĤ¹
    -0.91
    ãĥ´ãĤ¡
    -0.73
    ãĤ¦ãĤ¹
    -0.66
     partName
    -0.64
    issance
    -0.63
    shown
    -0.63
    ा
    -0.62
    metic
    -0.62
     Janeiro
    -0.61
    INA
    -0.60
    POSITIVE LOGITS
    zinski
    0.70
    acket
    0.61
    pson
    0.61
     Samurai
    0.59
    bley
    0.59
    iott
    0.59
    otos
    0.58
    bye
    0.58
    kes
    0.58
    oller
    0.57
    Act Density 0.032%

    No Known Activations