INDEX
    Explanations

    the opening of a text or a new section

    After capitalized words

    sequence specific manner

    New Auto-Interp
    Negative Logits
    onAttach
    -0.80
     myſelf
    -0.74
     henceforth
    -0.70
     Efq
    -0.70
     betweenstory
    -0.70
    raszam
    -0.69
     thereupon
    -0.69
     ویکی‌پدیا
    -0.67
    الحياه
    -0.65
    InjectAttribute
    -0.64
    POSITIVE LOGITS
    </h1>
    0.65
     }
    
    0.63
    </h4>
    0.63
    }{*}{
    0.63
    {...
    0.62
    ")]
    
    0.60
     ‘
    0.60
    )))
    
    0.60
    </h2>
    0.59
    ]))
    
    0.59
    Act Density 0.055%

    No Known Activations