INDEX
    Explanations

    The main thing this neuron does is detect the verb “climb” (in all its forms).

    New Auto-Interp
    Negative Logits
     dispersion
    -0.08
    128
    -0.07
     nozzle
    -0.07
     net
    -0.07
    -0.06
     rust
    -0.06
     husus
    -0.06
     mode
    -0.06
     technological
    -0.06
     Peng
    -0.06
    POSITIVE LOGITS
     climb
    0.13
     climbing
    0.12
     climbed
    0.12
     climbs
    0.10
     Clim
    0.08
    camp
    0.07
    Interpreter
    0.07
     steep
    0.07
    meg
    0.07
    0.07
    Act Density 0.006%

    No Known Activations