INDEX
    Explanations

    legal documents

    This neuron is recognizing tokens based on their position in the document, activating most strongly on words in the middle sections of these legal texts.

    New Auto-Interp
    Negative Logits
    care
    -0.07
     мыш
    -0.07
    .basic
    -0.06
     cigars
    -0.06
     Slovenia
    -0.06
     угод
    -0.06
    Tek
    -0.06
    .ON
    -0.06
    函数
    -0.06
     wheels
    -0.06
    POSITIVE LOGITS
     Conan
    0.06
    .preview
    0.06
    0.06
     TASK
    0.06
    [root
    0.06
     jLabel
    0.06
     payouts
    0.06
     ráno
    0.06
    HAL
    0.06
     Crushing
    0.06
    Act Density 0.004%

    No Known Activations