INDEX
    Explanations

    prepositions

    New Auto-Interp
    Negative Logits
     Ste
    -0.08
    mn
    -0.08
    -0.08
     steep
    -0.08
     riv
    -0.08
     ғылыми
    -0.07
    それ
    -0.07
    ,\"
    -0.07
     国内
    -0.07
    -0.07
    POSITIVE LOGITS
     einschließlich
    0.08
    /of
    0.08
    Kev
    0.08
    November
    0.07
     evidenced
    0.07
    0.07
    -wide
    0.07
    .discovery
    0.07
    News
    0.07
    0.07
    Act Density 0.351%

    No Known Activations