INDEX
    Explanations

    references to Russia and its political figures

    New Auto-Interp
    Negative Logits
     NDEBUG
    -0.66
    =$?
    -0.63
     ")");
    -0.63
     Diſ
    -0.62
     المبار
    -0.61
     itſelf
    -0.60
    อ้างอิง
    -0.59
    šanai
    -0.59
     fufficient
    -0.59
     ovation
    -0.58
    POSITIVE LOGITS
     hem
    1.03
    hem
    0.84
     Russian
    0.80
     Russia
    0.77
    Russian
    0.77
    XmlEnum
    0.77
    новниш
    0.75
     Hem
    0.72
     Justice
    0.71
    Hem
    0.70
    Act Density 0.119%

    No Known Activations