INDEX
    Explanations

    formatted headings and sections within a document

    Text after specific punctuation or formatting marks

    New Auto-Interp
    Negative Logits
    AndEndTag
    -0.76
     المعيارى
    -0.63
     الرياضيه
    -0.60
    SequentialGroup
    -0.58
    niſſe
    -0.57
    iſchen
    -0.56
     vPvB
    -0.55
    تفصیلات
    -0.54
    RectangleBorder
    -0.54
     ſei
    -0.52
    POSITIVE LOGITS
    としての
    0.41
     Concerning
    0.40
     the
    0.40
    RegressionTest
    0.40
    Concerning
    0.37
    による
    0.37
    %%
    
    0.36
    การ
    0.36
    <h1>
    0.36
    <h2>
    0.36
    Act Density 0.049%

    No Known Activations