INDEX
    Explanations

    specific linguistic elements from different languages or scripts, particularly related to terms or phrases that may be culturally or regionally significant

    New Auto-Interp
    Negative Logits
    ð
    -0.15
    å®ĺ
    -0.15
    ìľµ
    -0.14
    егоÑĢ
    -0.14
    eyh
    -0.14
    old
    -0.14
    оÑģнов
    -0.14
    lint
    -0.13
    oft
    -0.13
    Ù
    -0.13
    POSITIVE LOGITS
     pract
    0.14
    jeta
    0.14
    /topics
    0.14
     OCR
    0.14
    uler
    0.13
    ubber
    0.13
    Programming
    0.13
    linger
    0.13
    sembling
    0.13
    ivery
    0.13
    Act Density 0.025%

    No Known Activations