INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Gors
    -0.80
     cler
    -0.74
     tabl
    -0.74
     Sylv
    -0.73
     foremost
    -0.73
     Vie
    -0.73
     oun
    -0.73
     slips
    -0.72
     Ambrose
    -0.70
     destro
    -0.70
    POSITIVE LOGITS
    define
    1.37
    DIV
    1.26
    !/
    1.23
    Gamer
    1.22
    ################################
    1.21
    include
    1.15
    ########
    1.10
    DonaldTrump
    1.09
    RIP
    1.09
    Ask
    1.08
    Act Density 0.281%

    No Known Activations