INDEX
    Explanations

    references to specific social or political groups and events

    New Auto-Interp
    Negative Logits
    া
    -0.16
     }
    -0.16
     }↵
    -0.16
     ¶
    -0.14
     */↵
    -0.14
     ):↵
    -0.13
     Virt
    -0.13
    Įĵ
    -0.13
    rego
    -0.13
     »,
    -0.13
    POSITIVE LOGITS
    еÐ
    0.29
    ÑĢаÐ
    0.29
    оÐ
    0.23
    аÐ
    0.22
    ToolStripMenuItem
    0.14
    ________________________________________________________________
    0.14
    altet
    0.13
    -outs
    0.13
     outs
    0.13
    aN
    0.13
    Act Density 0.545%

    No Known Activations