INDEX
    Explanations

    this neuron seems to activate on numbers

    New Auto-Interp
    Negative Logits
     esternos
    -0.67
    rrggbb
    -0.67
    istoitu
    -0.67
     виправивши
    -0.66
    Архівовано
    -0.65
     ویکی‌پدیای
    -0.65
     دیکھیے
    -0.62
     transfieras
    -0.61
     Мексичка
    -0.60
    Ծանոթ
    -0.60
    POSITIVE LOGITS
    MathML
    0.66
    はじめに
    0.52
    FFF
    0.52
    BufferException
    0.49
    trä
    0.45
    GW
    0.44
    וח
    0.44
    B
    0.43
    Y
    0.42
    MW
    0.42
    Act Density 2.324%

    No Known Activations