INDEX
    Explanations

    references to high quality and excellence in various contexts

    New Auto-Interp
    Negative Logits
       
    -0.15
    λÏī
    -0.15
    ationally
    -0.15
    emic
    -0.15
    oke
    -0.15
    oko
    -0.15
    ết
    -0.14
    155
    -0.14
    oot
    -0.14
    ocker
    -0.14
    POSITIVE LOGITS
    -quality
    0.23
    bum
    0.17
    lah
    0.16
    ively
    0.16
    aire
    0.16
    人æīį
    0.15
    aires
    0.15
    ncpy
    0.15
    mind
    0.15
    ¢°
    0.15
    Act Density 0.033%

    No Known Activations