INDEX
    Explanations

    occurrences of legal and procedural language

    New Auto-Interp
    Negative Logits
    adio
    -0.16
    urus
    -0.15
    ży
    -0.14
    .getSelection
    -0.14
    Cb
    -0.14
    hol
    -0.14
    _CB
    -0.14
    ãĤ¤ãĤ¯
    -0.13
     DeÄŁ
    -0.13
    ocl
    -0.13
    POSITIVE LOGITS
    amburg
    0.15
    idable
    0.15
    erto
    0.15
    vang
    0.15
    .Blocks
    0.14
    shima
    0.14
    cient
    0.14
    agh
    0.14
    imetype
    0.14
    NewItem
    0.13
    Act Density 0.001%

    No Known Activations