INDEX
    Explanations

    This neuron doesn’t respond to any tokens—it remains inactive.

    New Auto-Interp
    Negative Logits
    oler
    -0.06
    udeau
    -0.06
     says
    -0.06
     GridBagConstraints
    -0.06
    INSTANCE
    -0.06
    νει
    -0.06
    OLER
    -0.06
     Institutions
    -0.06
    -0.06
     Self
    -0.06
    POSITIVE LOGITS
     Determine
    0.07
     ---↵
    0.07
    ительства
    0.07
    …↵
    0.07
    0.06
     bahis
    0.06
     exterior
    0.06
     UNESCO
    0.06
    pedia
    0.06
     Cialis
    0.06
    Act Density 0.112%

    No Known Activations