INDEX
    Explanations

    mentions of coaching or hiring in sports contexts

    New Auto-Interp
    Negative Logits
    undan
    -0.17
    dyby
    -0.15
    маÑħ
    -0.14
    áy
    -0.14
    uced
    -0.13
    ã썿ĢĿãģĨ
    -0.13
    ç±
    -0.13
    odian
    -0.13
     Heck
    -0.12
    ox
    -0.12
    POSITIVE LOGITS
     joins
    0.27
     replaces
    0.26
     succeeds
    0.24
     previously
    0.24
     beat
    0.23
     replace
    0.22
    Replace
    0.21
     Previously
    0.20
     Replace
    0.20
    jo
    0.19
    Act Density 0.071%

    No Known Activations