INDEX
    Explanations

    high values indicating quantities or scores in data outputs

    repeated mentions or references to the API and its functionality in a programming context

    archaic pronouns and titles

    New Auto-Interp
    Negative Logits
    RenderAtEndOf
    -0.57
    ">//
    -0.54
    три
    -0.49
    pola
    -0.48
     noDo
    -0.47
    三重
    -0.47
     tr
    -0.46
    3
    -0.45
    تقاوى
    -0.45
    jak
    -0.43
    POSITIVE LOGITS
     myſelf
    1.06
     itſelf
    0.99
     Monfieur
    0.94
     himſelf
    0.89
     auffi
    0.85
     ſche
    0.85
     ſtate
    0.84
     fevere
    0.84
     fubject
    0.83
     Majefty
    0.83
    Act Density 0.007%

    No Known Activations