INDEX
    Explanations

    news-related phrases

    phrases and words that introduce or emphasize parts of an argument or statement

    New Auto-Interp
    Negative Logits
     helicop
    -0.59
     Vaugh
    -0.59
     destro
    -0.55
     challeng
    -0.53
     blah
    -0.51
    farious
    -0.51
     advoc
    -0.51
    enegger
    -0.51
     behav
    -0.50
     â̦"
    -0.50
    POSITIVE LOGITS
    resa
    0.88
    xiety
    0.76
     largeDownload
    0.65
    BSD
    0.63
    odore
    0.60
    Berry
    0.59
    asionally
    0.57
    tymology
    0.56
    ories
    0.56
    ropolitan
    0.53
    Act Density 0.438%

    No Known Activations