INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Amen
    -0.77
     interven
    -0.70
    aukee
    -0.64
     wills
    -0.64
     sexes
    -0.61
    BOOK
    -0.61
    ieth
    -0.60
    ifice
    -0.60
     regenerate
    -0.60
     Constitution
    -0.59
    POSITIVE LOGITS
     understandably
    0.92
     rightfully
    0.91
     unsurprisingly
    0.86
     predictably
    0.84
     rightly
    0.79
    Yesterday
    0.75
     Turns
    0.75
    Apparently
    0.74
    Unfortunately
    0.74
     Unfortunately
    0.73
    Act Density 0.712%

    No Known Activations