INDEX
    Explanations

    This neuron detects mentions of user authentication actions, especially “log in,” “sign in,” or similar login phrases.

    New Auto-Interp
    Negative Logits
    <Project
    -0.06
    .folder
    -0.06
     kans
    -0.06
    нерг
    -0.06
    _thresh
    -0.06
     bundle
    -0.06
    (collection
    -0.06
    _curve
    -0.06
    Nhap
    -0.06
    pan
    -0.06
    POSITIVE LOGITS
     submit
    0.07
     Loading
    0.06
    claim
    0.06
     Started
    0.06
    Eth
    0.06
     세상
    0.06
    ें
    0.06
    дают
    0.06
     Running
    0.06
    -known
    0.06
    Act Density 0.013%

    No Known Activations