INDEX
    Explanations

    punctuation marks at the end of sentences and questions

    New Auto-Interp
    Negative Logits
    PYX
    -0.74
    Спољашње
    -0.61
    ніципалі
    -0.61
     Савезне
    -0.61
    хьтан
    -0.59
    GraphicsUnit
    -0.58
     Trinit
    -0.57
     ویکی‌پدیا
    -0.57
    -0.55
    شهاد
    -0.55
    POSITIVE LOGITS
    ↵↵
    1.48
    0.95
    </h3>
    0.79
    ↵↵↵
    0.78
    </strong>
    0.72
    </h4>
    0.69
    </blockquote>
    0.67
    </h2>
    0.62
    //};
    0.60
    </h5>
    0.59
    Act Density 0.559%

    No Known Activations