INDEX
    Explanations

    references to different ethnic groups or nationalities

    New Auto-Interp
    Negative Logits
    ($__
    -0.65
     my
    -0.64
     NSCoder
    -0.63
    my
    -0.62
    ///</
    -0.61
    richtet
    -0.57
    sidemargin
    -0.57
    pe
    -0.56
    tro
    -0.56
    TagHelpers
    -0.55
    POSITIVE LOGITS
     myſelf
    0.92
     themſelves
    0.91
    neſs
    0.90
     itſelf
    0.88
    ſelf
    0.85
     himſelf
    0.85
     raiſ
    0.85
     Chrif
    0.79
    存于互联网档案馆
    0.79
     faſt
    0.79
    Act Density 0.071%

    No Known Activations