INDEX
    Explanations

    instances of the word "excellent" or related terms indicating high quality or praise

    New Auto-Interp
    Negative Logits
       
    -0.16
    ến
    -0.15
    oleon
    -0.14
    emic
    -0.14
    ields
    -0.14
    oke
    -0.14
    ksam
    -0.14
    duk
    -0.14
    ationally
    -0.14
    oko
    -0.14
    POSITIVE LOGITS
    -quality
    0.22
    itude
    0.18
    iterals
    0.16
    ARRIER
    0.16
    -looking
    0.15
    lah
    0.15
    mente
    0.15
    ifar
    0.15
    ibrary
    0.14
    bih
    0.14
    Act Density 0.023%

    No Known Activations