INDEX
    Explanations

    various forms and contexts of the word "work."

    New Auto-Interp
    Negative Logits
    paravant
    -1.06
    quelize
    -0.95
    zbęd
    -0.94
     GIPHY
    -0.87
     CTP
    -0.87
     dignité
    -0.86
     Enfield
    -0.84
     desconhe
    -0.84
    Datuak
    -0.83
     unmute
    -0.82
    POSITIVE LOGITS
     work
    1.83
    Work
    1.64
     Work
    1.64
    work
    1.62
     works
    1.60
     WORK
    1.59
    WORK
    1.51
     Works
    1.42
    works
    1.39
    Works
    1.35
    Act Density 0.107%

    No Known Activations