INDEX
    Explanations

    punctuation marks and symbols

    New Auto-Interp
    Negative Logits
     McCorm
    -0.17
    ett
    -0.16
    ipar
    -0.15
    ies
    -0.15
     Mall
    -0.15
    rouw
    -0.14
     PIL
    -0.14
    esel
    -0.14
    LECT
    -0.14
     loop
    -0.14
    POSITIVE LOGITS
    public
    0.23
     class
    0.22
    class
    0.22
     public
    0.19
    ami
    0.16
    atore
    0.16
    :class
    0.15
     sınıf
    0.15
    aclass
    0.15
    	class
    0.15
    Act Density 0.012%

    No Known Activations