INDEX
    Explanations

    This neuron appears to be looking for a specific pattern of characters or words that don't conform to any recognizable language or structure

    specific characters or symbols, potentially from various languages or encoding systems

    New Auto-Interp
    Negative Logits
     bonded
    -0.78
    cius
    -0.76
     flour
    -0.75
     biod
    -0.73
     bour
    -0.73
     dominated
    -0.73
     rigged
    -0.73
     centrally
    -0.71
     glossy
    -0.71
     polyg
    -0.70
    POSITIVE LOGITS
    ãģŁ
    1.86
    ãģ¦
    1.82
    ãģĦ
    1.81
    ãģ¾
    1.81
    ãĤĭ
    1.80
    ãĤ
    1.78
    ãģ
    1.75
    ãģª
    1.75
    ãĢģ
    1.72
    ãģ§
    1.71
    Act Density 0.019%

    No Known Activations