INDEX
    Explanations

    terms related to scientific measurement and quantification

    New Auto-Interp
    Negative Logits
     myſelf
    -1.15
     themſelves
    -1.12
    UnusedPrivate
    -1.11
     pleaſure
    -1.10
    RenderAtEndOf
    -1.06
    ſelves
    -1.05
    ſelf
    -1.05
     ſeveral
    -1.04
     Reſ
    -1.04
     ſever
    -1.04
    POSITIVE LOGITS
    0.58
    ,
    0.58
     that
    0.53
     as
    0.47
     on
    0.47
     to
    0.45
     (
    0.44
    D
    0.44
     e
    0.43
     .
    0.43
    Act Density 1.537%

    No Known Activations