INDEX
    Explanations

    references to awards, nominations, and achievements

    references to award categories and their respective nominations or wins

    New Auto-Interp
    Negative Logits
    gypt
    -0.76
     probing
    -0.68
    utherford
    -0.67
    ptin
    -0.65
     coli
    -0.63
     bypass
    -0.63
     awake
    -0.62
    peror
    -0.62
     Juda
    -0.62
    manent
    -0.61
    POSITIVE LOGITS
     Best
    1.09
    seller
    1.06
     Worst
    1.01
    Best
    1.00
    sell
    0.93
    iary
    0.91
    worst
    0.91
    Winner
    0.87
    iaries
    0.83
    hest
    0.81
    Act Density 0.009%

    No Known Activations