INDEX
    Explanations

    phrases related to politics and news

    New Auto-Interp
    Negative Logits
    been
    -0.76
     anymore
    -0.68
    heed
    -0.68
    tan
    -0.63
    arta
    -0.63
    venge
    -0.62
    yond
    -0.62
    volent
    -0.58
     Yourself
    -0.58
    magic
    -0.56
    POSITIVE LOGITS
     abruptly
    0.72
    nesday
    0.70
     briefly
    0.69
     tremend
    0.68
     initially
    0.67
     originally
    0.67
     unanimously
    0.66
     last
    0.65
     yesterday
    0.64
     unsuccessfully
    0.63
    Act Density 15.271%

    No Known Activations