INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Cordialement
    -0.73
     noble
    -0.72
    noble
    -0.67
    pozdrawiam
    -0.66
    substantial
    -0.65
     noblest
    -0.63
     atve
    -0.61
    stability
    -0.59
    Noble
    -0.58
     tramonto
    -0.57
    POSITIVE LOGITS
    ly
    1.23
    ity
    0.85
    ized
    0.82
    ist
    0.77
    men
    0.75
    ism
    0.74
    ally
    0.73
    id
    0.69
    ities
    0.69
    LY
    0.69
    Act Density 0.046%

    No Known Activations