INDEX
    Explanations

    expressions of admiration and appreciation, especially related to creativity and performance

    New Auto-Interp
    Negative Logits
    peare
    -0.17
    erto
    -0.17
     landing
    -0.16
    igner
    -0.16
    landing
    -0.16
    Ħ
    -0.15
    bbe
    -0.15
    XY
    -0.15
    ãĤ©
    -0.15
    anio
    -0.15
    POSITIVE LOGITS
     loose
    0.15
    ida
    0.15
    chia
    0.14
    ushi
    0.14
    anth
    0.14
    -f
    0.14
    utt
    0.14
    ìĬ¤íħĮ
    0.13
    anna
    0.13
     backward
    0.13
    Act Density 0.132%

    No Known Activations