INDEX
    Explanations

    The neuron detects occurrences of the word “interim,” i.e. mentions of temporary or acting appointments.

    New Auto-Interp
    Negative Logits
     Nancy
    -0.07
    -0.07
     Savage
    -0.07
     faces
    -0.06
    igel
    -0.06
     mundial
    -0.06
    ються
    -0.06
     \(
    -0.06
     слова
    -0.06
     SendMessage
    -0.06
    POSITIVE LOGITS
     interim
    0.12
    hom
    0.07
    0.07
    rt
    0.07
     آی
    0.07
     крит
    0.07
     SCORE
    0.07
    abinet
    0.06
    imeInterval
    0.06
    0.06
    Act Density 0.001%

    No Known Activations