INDEX
    Explanations

    phrases related to people and their actions within news articles

    New Auto-Interp
    Negative Logits
     Guys
    -0.65
     VIDEOS
    -0.64
     Bound
    -0.61
     Anything
    -0.60
    Georg
    -0.59
    IND
    -0.58
     Meaning
    -0.58
     economical
    -0.56
    UT
    -0.56
    SE
    -0.56
    POSITIVE LOGITS
     oversaw
    1.18
     specializes
    1.15
     owns
    1.11
     oversees
    1.08
     resided
    1.08
     preceded
    1.01
     attends
    1.01
     specialize
    1.00
     resides
    1.00
     presided
    0.99
    Act Density 0.584%

    No Known Activations