INDEX
    Explanations

    phrases related to interviews or press conferences

    references to news programming or discussions surrounding political events

    New Auto-Interp
    Negative Logits
     sake
    -0.74
    opsis
    -0.66
    ĨĴ
    -0.64
    emp
    -0.62
    hood
    -0.61
     arithmetic
    -0.60
     mol
    -0.60
    ength
    -0.59
     Mahjong
    -0.58
    steen
    -0.58
    POSITIVE LOGITS
    adr
    0.81
    orage
    0.75
    lyak
    0.74
    rolet
    0.70
    ritic
    0.70
    riot
    0.68
     Dull
    0.68
    OTOS
    0.67
    autions
    0.67
    iets
    0.66
    Act Density 0.117%

    No Known Activations