INDEX
    Explanations

    phrases related to prohibitions or restrictions

    phrases indicating negation or absence

    New Auto-Interp
    Negative Logits
     Reloaded
    -0.81
     Reborn
    -0.72
     "$:/
    -0.70
     Royale
    -0.69
     vind
    -0.68
     restored
    -0.68
     chapter
    -0.66
     Fork
    -0.66
    gypt
    -0.63
     Unch
    -0.63
    POSITIVE LOGITS
    brainer
    1.31
    strings
    1.16
    repeat
    1.13
    exc
    1.11
    reply
    1.09
    smoking
    1.08
    contact
    1.08
    platform
    1.05
    matter
    1.03
    notice
    1.02
    Act Density 0.016%

    No Known Activations