INDEX
    Explanations

    phrases related to changing one's mind or decision-making processes based on new information or arguments

    New Auto-Interp
    Negative Logits
     increa
    -1.95
     inev
    -1.91
     affor
    -1.90
     guarante
    -1.89
     volunte
    -1.88
     disagre
    -1.87
     depic
    -1.87
     accla
    -1.86
     encomp
    -1.86
     snoopy
    -1.85
    POSITIVE LOGITS
     kasarigan
    0.83
     stance
    0.70
    parsedMessage
    0.66
     regarding
    0.65
     decision
    0.64
     about
    0.63
    Nullable
    0.63
    Nonnull
    0.63
    forChild
    0.62
    awtextra
    0.62
    Act Density 0.269%

    No Known Activations