INDEX
    Explanations

    The neuron flags marketing superlative phrases about upholding exceptionally high standards of quality or safety.

    New Auto-Interp
    Negative Logits
     erroneous
    -0.08
     interaction
    -0.07
     Eff
    -0.06
     İst
    -0.06
    _bag
    -0.06
     cosy
    -0.06
     Jessica
    -0.06
     frequent
    -0.06
    .jet
    -0.06
     Interaction
    -0.06
    POSITIVE LOGITS
     standards
    0.13
     Standards
    0.11
    _STD
    0.07
     Серед
    0.07
    Targets
    0.07
     předpis
    0.07
    0.07
    0.07
     onlar
    0.07
    В
    0.07
    Act Density 0.015%

    No Known Activations