INDEX
    Explanations

    The neuron activates on occurrences of the word “valid.”

    New Auto-Interp
    Negative Logits
    Between
    -0.07
     compression
    -0.07
     Cluster
    -0.07
    house
    -0.07
    _course
    -0.07
     Incorrect
    -0.07
    maze
    -0.07
    responseObject
    -0.06
     Increment
    -0.06
     Floor
    -0.06
    POSITIVE LOGITS
     valid
    0.11
     Valid
    0.11
     salvage
    0.08
     good
    0.07
    (valid
    0.07
     fair
    0.07
    VALID
    0.07
     stands
    0.07
     solids
    0.07
    .Valid
    0.07
    Act Density 0.008%

    No Known Activations