INDEX
    Explanations

    references to the city of Beijing

    New Auto-Interp
    Negative Logits
    oway
    -0.73
    adoes
    -0.73
    osate
    -0.72
    RH
    -0.72
     Trooper
    -0.69
    pent
    -0.68
    âĢ¢âĢ¢âĢ¢âĢ¢
    -0.68
    ocene
    -0.67
    RANT
    -0.67
     Hitchcock
    -0.65
    POSITIVE LOGITS
    ijing
    1.17
     Jinping
    1.04
     Beijing
    1.00
     Lumpur
    0.99
     Yuan
    0.89
    jing
    0.88
    zhou
    0.86
    jin
    0.84
     Jing
    0.81
    wei
    0.81
    Act Density 0.012%

    No Known Activations