INDEX
    Explanations

    phrases indicating a lack of responsibility or failure

    New Auto-Interp
    Negative Logits
     Raiders
    -0.69
     Bár
    -0.66
     Roanoke
    -0.64
    "><?=
    -0.63
     recevez
    -0.63
     jugé
    -0.62
     WERE
    -0.62
     ricar
    -0.62
     Eunice
    -0.62
     mists
    -0.61
    POSITIVE LOGITS
     been
    1.05
     have
    0.93
     has
    0.92
     had
    0.85
    ve
    0.78
     оригіналу
    0.77
    >({
    0.72
     Ive
    0.71
    httphttps
    0.71
     Has
    0.68
    Act Density 0.108%

    No Known Activations