INDEX
    Explanations

    phrases related to education and academic achievement

    New Auto-Interp
    Negative Logits
    bir
    -0.16
    born
    -0.16
     aliqu
    -0.15
    ethical
    -0.15
    usz
    -0.14
    blem
    -0.14
    isu
    -0.14
    iro
    -0.14
    bons
    -0.14
    olik
    -0.14
    POSITIVE LOGITS
    λε
    0.16
    phia
    0.16
    ãĥ¼ãĥĹ
    0.16
    toFloat
    0.15
    eração
    0.15
    .toFloat
    0.15
    andle
    0.14
    ãn
    0.14
     level
    0.14
    esser
    0.14
    Act Density 0.078%

    No Known Activations