INDEX
    Explanations

    neurons related to programming and technical language

    Mathematical or unusual symbols

    technical terms and punctuation

    New Auto-Interp
    Negative Logits
    </em>
    -0.49
     —
    -0.49
    يا
    -0.48
     =
    -0.47
    だけに
    -0.47
     morti
    -0.46
     #
    -0.46
    JI
    -0.46
     @
    -0.45
     .
    -0.45
    POSITIVE LOGITS
    TagMode
    0.92
    abestanden
    0.91
    Hochspringen
    0.88
    存于互联网档案馆
    0.87
     للمعارف
    0.82
    ViewImports
    0.82
    AutoresizingMask
    0.80
     ویکی‌پدیای
    0.79
     سكانية
    0.78
     Roskov
    0.78
    Act Density 0.024%

    No Known Activations