INDEX
    Explanations

    phrases related to educational accomplishments and reputations

    New Auto-Interp
    Negative Logits
     advancements
    -0.71
     showcased
    -0.69
     somit
    -0.66
     אשר
    -0.66
    自身の
    -0.66
     attire
    -0.64
     showcasing
    -0.63
     captivating
    -0.62
    ]='\
    -0.62
    Upon
    -0.62
    POSITIVE LOGITS
     things
    0.90
     somebody
    0.83
     cosas
    0.82
     stuff
    0.81
     probably
    0.81
     wierd
    0.80
     basically
    0.78
     Надо
    0.78
     Somebody
    0.78
     dingen
    0.77
    Act Density 2.611%

    No Known Activations