INDEX
    Explanations

    instances of the word "из" (meaning "from" or "of" in Russian)

    New Auto-Interp
    Negative Logits
    iveau
    -0.16
     ade
    -0.16
    frei
    -0.16
    ROTO
    -0.16
    زة
    -0.15
    ulle
    -0.15
    ÏĦαι
    -0.15
    ÙĤÙĦ
    -0.14
    stÅĻÃŃ
    -0.14
    )))),
    -0.14
    POSITIVE LOGITS
    rael
    0.20
    quierda
    0.19
    abela
    0.18
    -за
    0.18
    gon
    0.18
    ogen
    0.15
    ilian
    0.15
    gie
    0.15
    g
    0.15
    source
    0.14
    Act Density 0.004%

    No Known Activations