INDEX
    Explanations

    occurrences of the word "по" and its variations

    New Auto-Interp
    Negative Logits
    WISE
    -0.16
    íĨ¡
    -0.15
     dán
    -0.14
    atos
    -0.14
    stad
    -0.14
    çon
    -0.14
    avou
    -0.14
    оÑĢоз
    -0.14
    Äįan
    -0.14
    Äįné
    -0.14
    POSITIVE LOGITS
    oser
    0.16
     pur
    0.16
    erli
    0.15
     reign
    0.15
     cur
    0.15
    иг
    0.15
    ame
    0.14
    isser
    0.14
     mutual
    0.14
    oad
    0.14
    Act Density 0.013%

    No Known Activations