INDEX
    Explanations

    characters and symbols from a non-Latin script, likely related to a specific language

    This neuron is looking for text in a language that uses the Cyrillic alphabet

    New Auto-Interp
    Negative Logits
    assador
    -0.78
    essa
    -0.78
    emonium
    -0.75
    combe
    -0.73
     Starr
    -0.69
    ierrez
    -0.69
    iqueness
    -0.69
    worldly
    -0.67
    aido
    -0.67
    ernels
    -0.67
    POSITIVE LOGITS
    к
    1.61
    ÑĤ
    1.57
    м
    1.53
    е
    1.53
    Ñ
    1.51
    Ñı
    1.50
    д
    1.47
    ÑĢ
    1.47
    Ð
    1.44
    л
    1.39
    Act Density 0.012%

    No Known Activations