INDEX
    Explanations

    The neuron fires primarily on instances of the quantifier “all.”

    New Auto-Interp
    Negative Logits
    .HasValue
    -0.07
     grandchildren
    -0.06
     purpos
    -0.06
     ذات
    -0.06
    .Direction
    -0.06
    (LogLevel
    -0.06
    -0.06
    “我
    -0.06
     Looking
    -0.06
    ArgsConstructor
    -0.06
    POSITIVE LOGITS
     all
    0.07
    이나
    0.07
    \R
    0.07
     كل
    0.07
    ][-
    0.07
    (URL
    0.06
    _fail
    0.06
     arrive
    0.06
     Aws
    0.06
     <?=
    0.06
    Act Density 0.036%

    No Known Activations