INDEX
    Explanations

    punctuation and formatting cues related to data presentation

    New Auto-Interp
    Negative Logits
    OGND
    -0.52
    RTSC
    -0.51
    Gön
    -0.51
     aapt
    -0.44
    Chham
    -0.42
    raisals
    -0.40
    DrawerToggle
    -0.40
    Diweddarwch
    -0.39
    apter
    -0.39
    alse
    -0.39
    POSITIVE LOGITS
    SequentialGroup
    0.48
     Wikiseite
    0.46
    Sziasztok
    0.43
     imprimée
    0.40
    0.39
    ioterapia
    0.38
     fjor
    0.38
    
    0.38
    Notae
    0.38
     biały
    0.38
    Act Density 0.025%

    No Known Activations