INDEX
    Explanations

    special formatting characters or symbols used in an encoded context

    ending in "self" or containing math symbols

    New Auto-Interp
    Negative Logits
     R
    -0.57
     "
    -0.56
    -0.54
     K
    -0.54
     (
    -0.54
     T
    -0.53
     V
    -0.53
     Pat
    -0.52
    ,
    -0.52
     E
    -0.52
    POSITIVE LOGITS
     myſelf
    1.11
     itſelf
    1.02
     للاسماء
    1.01
     themſelves
    1.00
     CreateTagHelper
    0.97
     Theſe
    0.96
     propOrder
    0.95
    ſelf
    0.92
     Efq
    0.91
     auffi
    0.90
    Act Density 0.001%

    No Known Activations