INDEX
    Explanations

    negative aspects

    The neuron activates on language pointing out defects, disadvantages, or shortcomings—i.e. negative evaluations of prior work.

    New Auto-Interp
    Negative Logits
     itm
    -0.08
     ache
    -0.07
    lection
    -0.07
    _Login
    -0.07
     flower
    -0.06
     Hồ
    -0.06
    .login
    -0.06
     Winning
    -0.06
     methods
    -0.06
    δώ
    -0.06
    POSITIVE LOGITS
     хими
    0.07
    pyx
    0.07
     intermedi
    0.06
     OutlineInputBorder
    0.06
    ignore
    0.06
    582
    0.06
    glyphicon
    0.06
     FirebaseDatabase
    0.06
     IService
    0.06
     $#
    0.06
    Act Density 0.026%

    No Known Activations