INDEX
    Explanations

    The neuron responds to words that signal importance or necessity (e.g. “important,” “crucial”).

    New Auto-Interp
    Negative Logits
    Datetime
    -0.08
    líž
    -0.08
    alendar
    -0.07
    Sleep
    -0.07
    .get
    -0.07
    ieten
    -0.07
     ngày
    -0.07
     falling
    -0.07
    title
    -0.07
    malar
    -0.07
    POSITIVE LOGITS
     crucial
    0.13
     Cruc
    0.09
    0.07
    quan
    0.07
    @js
    0.07
     Crimes
    0.07
    ??
    0.06
     essential
    0.06
     vital
    0.06
     necess
    0.06
    Act Density 0.019%

    No Known Activations