INDEX
    Explanations

    references to serious incidents and their consequences

    New Auto-Interp
    Negative Logits
    tsy
    -0.17
    .scalablytyped
    -0.17
    ighton
    -0.15
     pozor
    -0.14
     violence
    -0.14
    елик
    -0.14
    /***/
    -0.14
    urgeon
    -0.13
    VED
    -0.13
     crime
    -0.13
    POSITIVE LOGITS
     Wich
    0.18
    ÑĢок
    0.18
    pong
    0.16
     Pvt
    0.15
    trad
    0.15
    agers
    0.15
     Strauss
    0.14
     Harding
    0.14
    æ¶ī
    0.14
     Tradition
    0.14
    Act Density 0.357%

    No Known Activations