INDEX
    Explanations

    tokens in mathematical or scientific discourse, like "measure", "using", numbers, and symbols like "g"

    scientific terminology

    New Auto-Interp
    Negative Logits
     myſelf
    -1.11
     himſelf
    -1.09
     themſelves
    -1.01
     fubject
    -1.00
     pleaſure
    -0.98
     itſelf
    -0.98
     Monfieur
    -0.96
     purpoſe
    -0.96
     deſt
    -0.94
     Majefty
    -0.93
    POSITIVE LOGITS
     Gra
    0.53
     work
    0.52
    }",
    0.50
    0.48
     ra
    0.48
     de
    0.48
     Sav
    0.48
     Sha
    0.47
     the
    0.47
     re
    0.47
    Act Density 2.453%

    No Known Activations