INDEX
    Explanations

    terms related to news about politics and world events, specifically focusing on controversial or sensitive topics

    New Auto-Interp
    Negative Logits
     yield
    -0.73
     pyramid
    -0.69
     logger
    -0.69
     rank
    -0.67
     opportunities
    -0.65
     fortun
    -0.62
     handler
    -0.61
     jog
    -0.60
     convenience
    -0.60
     advantage
    -0.60
    POSITIVE LOGITS
    ï¸ı
    1.38
    ski
    0.95
    ï¸
    0.91
    _>
    0.91
    £
    0.90
    iversary
    0.88
    Balt
    0.88
    ews
    0.87
    capital
    0.86
    AFP
    0.83
    Act Density 0.213%

    No Known Activations