INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    20439
    -0.82
     Wink
    -0.78
    itudinal
    -0.74
     Powers
    -0.72
     Telegram
    -0.71
     Archangel
    -0.68
    aird
    -0.68
     2022
    -0.68
     Crosby
    -0.68
     Gib
    -0.67
    POSITIVE LOGITS
    stuff
    1.09
    borne
    0.95
     eaten
    0.91
     eater
    0.91
     tasted
    0.91
    cakes
    0.88
    meat
    0.87
     poisoning
    0.87
    namese
    0.86
     additives
    0.86
    Act Density 0.022%

    No Known Activations