INDEX
    Explanations

    the word "good" and related words indicating positivity or high quality

    phrases or contexts indicating positive quality or reassurance

    New Auto-Interp
    Negative Logits
    hyde
    -0.79
    eters
    -0.75
    oths
    -0.73
    atum
    -0.73
    pper
    -0.70
    eds
    -0.69
    ategory
    -0.67
     Tsukuyomi
    -0.67
    opers
    -0.66
     Reincarnated
    -0.65
    POSITIVE LOGITS
    enough
    1.35
     luck
    1.12
    bye
    1.05
    reads
    1.04
     ol
    1.03
    luck
    1.03
     enough
    1.03
     Samar
    1.02
     intentions
    1.00
     quality
    0.89
    Act Density 0.081%

    No Known Activations