INDEX
    Explanations

    phrases related to economic and political commentary, government actions, and societal issues

    New Auto-Interp
    Negative Logits
    dimension
    -0.59
    ãĥĪ
    -0.58
    ãĥĻ
    -0.57
    \":
    -0.55
    å°Ĩ
    -0.55
    ewitness
    -0.54
    ãĥŀ
    -0.52
    ãģĹ
    -0.51
    itely
    -0.50
    pires
    -0.49
    POSITIVE LOGITS
    ;
    1.12
     whereas
    1.01
     because
    1.01
    .;
    0.99
     though
    0.98
     although
    0.95
     but
    0.93
     however
    0.88
     unless
    0.83
    .
    0.82
    Act Density 1.433%

    No Known Activations