INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ensibly
    -0.83
    usting
    -0.64
     shrunk
    -0.63
     shrinking
    -0.58
     overpowered
    -0.57
     majesty
    -0.57
     flourishing
    -0.56
     snowball
    -0.56
     rightfully
    -0.55
    omics
    -0.55
    POSITIVE LOGITS
     Lastly
    1.04
     =================================================================
    1.00
     ********************************
    1.00
    ================================================================
    0.92
     =================================
    0.91
     Finally
    0.89
     Alternatively
    0.88
     Similarly
    0.86
     Additionally
    0.85
     Else
    0.82
    Act Density 0.241%

    No Known Activations