INDEX
    Explanations

    phrases that emphasize the concept of news reporting or significant events

    New Auto-Interp
    Negative Logits
     awa
    -0.77
    KK
    -0.72
    respective
    -0.71
    wagon
    -0.69
    PB
    -0.67
     supervised
    -0.67
    onne
    -0.67
     Canaver
    -0.67
    netflix
    -0.65
    aceous
    -0.64
    POSITIVE LOGITS
     Errors
    0.89
     Hours
    0.74
     Herald
    0.74
     Seasons
    0.72
     Liberties
    0.67
    Times
    0.66
    requency
    0.65
     Inquiry
    0.64
     Planet
    0.63
     Ages
    0.63
    Act Density 0.020%

    No Known Activations