INDEX
    Explanations

    "it" and "is"

    New Auto-Interp
    Negative Logits
    npos
    -0.07
     tep
    -0.06
    сты
    -0.06
     suspect
    -0.06
    [counter
    -0.06
     куда
    -0.06
    omaly
    -0.06
    odka
    -0.06
     wrestling
    -0.06
     především
    -0.06
    POSITIVE LOGITS
    0.07
    0.07
    ил
    0.07
    air
    0.07
    available
    0.06
    생활
    0.06
    aint
    0.06
    ayne
    0.06
     keywords
    0.06
    ifa
    0.06
    Act Density 0.058%

    No Known Activations