INDEX
    Explanations

    mentions of groups of people and their interactions with authority

    New Auto-Interp
    Negative Logits
    UnsafeEnabled
    -0.54
     препратки
    -0.53
    ArrowToggle
    -0.51
     مشين
    -0.51
    Enllaces
    -0.50
    Filmographie
    -0.49
    parsedMessage
    -0.48
     propi
    -0.47
     następu
    -0.46
    invokeLater
    -0.46
    POSITIVE LOGITS
     pund
    0.64
    ftagPool
    0.63
    schild
    0.63
    0.59
    onAttach
    0.59
    HttpPut
    0.59
    ertale
    0.57
     Curl
    0.56
    initState
    0.55
     mainstream
    0.54
    Act Density 0.282%

    No Known Activations