INDEX
    Explanations

    instances of the word "work" and its variations

    New Auto-Interp
    Negative Logits
    paravant
    -1.01
    quelize
    -0.96
    zbęd
    -0.94
     CTP
    -0.91
     Enfield
    -0.88
     GIPHY
    -0.86
     Nadel
    -0.84
     desconhe
    -0.81
     purpoſe
    -0.81
     unmute
    -0.81
    POSITIVE LOGITS
     work
    1.65
     Work
    1.52
    Work
    1.52
     WORK
    1.49
     works
    1.49
    work
    1.44
    WORK
    1.39
     Works
    1.34
    Works
    1.28
    works
    1.28
    Act Density 0.091%

    No Known Activations