INDEX
    Explanations

    This neuron does not consistently activate for any particular token sequence and appears not to detect any specific pattern.

    New Auto-Interp
    Negative Logits
     Atlantis
    -0.07
    Traffic
    -0.06
    _CPP
    -0.06
    22
    -0.06
    ворю
    -0.06
     canActivate
    -0.06
    doctrine
    -0.06
    clared
    -0.06
    stacles
    -0.06
    уются
    -0.06
    POSITIVE LOGITS
     innoc
    0.07
    -mask
    0.07
     фай
    0.07
     stitched
    0.06
    	restore
    0.06
     sounded
    0.06
    0.06
     chiếc
    0.06
    .....
    0.06
    جع
    0.06
    Act Density 0.025%

    No Known Activations