INDEX
    Explanations

    sections of a document that express logical reasoning or argumentation about decision-making processes.

    The neuron detects words and phrases that convey rushing or hasty action (e.g., “rush,” “rushing,” “rushed”).

    New Auto-Interp
    Negative Logits
     Positive
    -0.08
    élé
    -0.07
     compte
    -0.07
     Decom
    -0.07
     resilient
    -0.06
     Find
    -0.06
    ivalent
    -0.06
     Prote
    -0.06
     comun
    -0.06
     protein
    -0.06
    POSITIVE LOGITS
     rush
    0.16
     Rush
    0.15
     rushing
    0.13
     rushed
    0.12
     hurried
    0.11
     hurry
    0.11
     rushes
    0.10
     Hur
    0.08
    Hur
    0.08
    rush
    0.08
    Act Density 0.005%

    No Known Activations