INDEX
    Explanations

    phrases that emphasize the concept of gradual progress or taking steps over time

    New Auto-Interp
    Negative Logits
    /fw
    -0.17
    EGA
    -0.15
     Artem
    -0.15
    ickey
    -0.14
    fern
    -0.14
    ê°ij
    -0.14
    cape
    -0.14
    شرÙĥØ©
    -0.14
    ÙĬÙĦÙħ
    -0.14
     gam
    -0.14
    POSITIVE LOGITS
    slow
    0.16
     Slow
    0.16
     slowly
    0.15
     extr
    0.15
    218
    0.15
     slow
    0.14
    loth
    0.14
    ợi
    0.14
     incremental
    0.14
     pie
    0.14
    Act Density 0.027%

    No Known Activations