INDEX
    Explanations

    the word "just" and its variations, indicating a focus on simplicity or minimalism in expression

    New Auto-Interp
    Negative Logits
    uzzi
    -0.17
    á»iji
    -0.16
    åª
    -0.15
    à¥įवव
    -0.15
     Allah
    -0.14
    iosk
    -0.14
    عÙĦÙĤ
    -0.14
     Decomp
    -0.13
    æĥ
    -0.13
    çĮ
    -0.13
    POSITIVE LOGITS
    rys
    0.16
    à¹Ĩ
    0.14
    .rev
    0.14
    >'.↵
    0.14
    EO
    0.14
    vis
    0.14
    tics
    0.13
    haf
    0.13
     HttpServlet
    0.13
    enger
    0.13
    Act Density 0.031%

    No Known Activations