INDEX
    Explanations

    mentions of historical events or occurrences that happened in the past

    phrases that indicate new or significant entities

    New Auto-Interp
    Negative Logits
    imar
    -0.75
    Favorite
    -0.75
    Arcade
    -0.73
    ãĥĬ
    -0.72
    inently
    -0.72
    anism
    -0.71
    views
    -0.71
    AIDS
    -0.71
    frames
    -0.70
    Avoid
    -0.70
    POSITIVE LOGITS
     colleague
    1.06
     handful
    1.05
     gunman
    0.97
     reporter
    0.97
     group
    0.96
     delegation
    0.96
     majority
    0.95
     consortium
    0.95
     spate
    0.94
     flurry
    0.93
    Act Density 0.177%

    No Known Activations