INDEX
    Explanations

    instances of the word "os."

    New Auto-Interp
    Negative Logits
     nakalista
    -0.66
    -0.60
     autorytatywna
    -0.59
     distanciation
    -0.58
     Dage
    -0.58
     bezeichneter
    -0.58
     препратки
    -0.57
    pkey
    -0.56
    -0.56
     agré
    -0.56
    POSITIVE LOGITS
    os
    4.20
    OS
    3.03
     os
    2.54
    Os
    2.35
     Os
    2.23
     OS
    2.10
    osk
    1.54
    osz
    1.48
    osy
    1.34
    osc
    1.29
    Act Density 0.065%

    No Known Activations