INDEX
    Explanations

    information related to software updates or changes

    New Auto-Interp
    Negative Logits
    lies
    -0.71
     Relief
    -0.69
    Must
    -0.68
     Awakens
    -0.67
     Borders
    -0.65
     Needs
    -0.64
    terday
    -0.63
    âĦ¢:
    -0.63
    ives
    -0.61
    rones
    -0.61
    POSITIVE LOGITS
     likened
    1.05
     replaced
    0.97
     deemed
    0.95
     subjected
    0.93
     criticized
    0.93
     taken
    0.93
    avering
    0.93
     upgraded
    0.91
     hijacked
    0.90
     dubbed
    0.89
    Act Density 0.196%

    No Known Activations