INDEX
    Explanations

    occurrences of single quotes

    New Auto-Interp
    Negative Logits
    illac
    -0.14
    bÃŃr
    -0.14
    autiful
    -0.14
    .toolbox
    -0.14
    ÑĢава
    -0.13
    ctl
    -0.13
    lassian
    -0.13
    y
    -0.13
    νÏī
    -0.13
     Jako
    -0.13
    POSITIVE LOGITS
    cee
    0.14
    roys
    0.14
    ITS
    0.14
    ries
    0.13
    uzzi
    0.13
    ikes
    0.13
    ustin
    0.13
    /goto
    0.13
     Schultz
    0.13
     DVD
    0.13
    Act Density 0.015%

    No Known Activations