INDEX
    Explanations

    references to historical events, particularly related to Tiananmen Square

    references to the Tiananmen Square events

    New Auto-Interp
    Negative Logits
     Wrap
    -0.66
     McH
    -0.66
    imal
    -0.64
    wagen
    -0.63
    mobi
    -0.63
    ãģĨ
    -0.63
    anthrop
    -0.62
    lihood
    -0.61
    ï¸
    -0.61
     jackets
    -0.60
    POSITIVE LOGITS
     Tian
    0.93
    eworld
    0.81
    jin
    0.80
    ept
    0.76
    ellation
    0.76
     glim
    0.76
    encia
    0.72
    zza
    0.72
    borgh
    0.70
    enza
    0.69
    Act Density 0.012%

    No Known Activations