INDEX
    Explanations

    emphasis or attention-grabbing symbols and text patterns

    special characters or symbols in the text

    New Auto-Interp
    Negative Logits
    cephal
    -0.68
    uces
    -0.66
     spir
    -0.66
     rack
    -0.65
    kered
    -0.65
    azing
    -0.62
     srf
    -0.62
    ved
    -0.62
     scattering
    -0.61
     manifold
    -0.61
    POSITIVE LOGITS
    WARNING
    0.98
    THIS
    0.97
    !/
    0.93
     WARNING
    0.86
    UPDATE
    0.86
    Discussion
    0.82
     ***
    0.82
    NOT
    0.80
    EDIT
    0.80
    DOWN
    0.79
    Act Density 0.046%

    No Known Activations