INDEX
    Explanations

    proper nouns and organizations

    references to specific organizations, groups, or entities

    New Auto-Interp
    Negative Logits
    ;;;;
    -0.59
    thood
    -0.59
    $.
    -0.57
    é¾įå
    -0.56
    ]+
    -0.55
    emale
    -0.54
    *.
    -0.53
    /,
    -0.53
    farious
    -0.53
    +.
    -0.52
    POSITIVE LOGITS
     has
    0.59
     will
    0.49
     announced
    0.49
     continues
    0.49
     wants
    0.48
    !--
    0.47
     continued
    0.47
     responded
    0.47
     began
    0.47
     stopped
    0.46
    Act Density 1.301%

    No Known Activations