INDEX
    Explanations

    control commands or instructions within a technical context

    Tokens after colons or underscores

    technical, archaic, diverse scripts

    New Auto-Interp
    Negative Logits
    </em>
    -0.60
    </strong>
    -0.57
     com
    -0.48
    !
    -0.47
     fin
    -0.45
     doi
    -0.43
     zap
    -0.43
     p
    -0.42
    -0.42
     b
    -0.42
    POSITIVE LOGITS
     Efq
    1.16
     Monfieur
    0.98
    ſelf
    0.91
     myſelf
    0.90
     Jefus
    0.88
    хьтан
    0.86
     ErrIntOverflow
    0.86
     Shakspeare
    0.85
    AxisAlignment
    0.85
     itſelf
    0.83
    Act Density 0.286%

    No Known Activations