INDEX
    Explanations

    mathematical formulas and symbols.

    mathematical formulas

    New Auto-Interp
    Negative Logits
     D
    -0.50
     Rom
    -0.50
    <bos>
    -0.50
     sig
    -0.48
     Z
    -0.44
     Y
    -0.43
     Push
    -0.43
    romas
    -0.43
     blo
    -0.43
     drawing
    -0.42
    POSITIVE LOGITS
    Personendaten
    0.90
     Anſ
    0.78
     Majefty
    0.77
     ſtate
    0.76
     Diſ
    0.76
     poffe
    0.75
     fubject
    0.74
     myſelf
    0.73
     ſeveral
    0.73
     defire
    0.72
    Act Density 0.344%

    No Known Activations