INDEX
    Explanations

    markers of structured sections, especially numbered list items and outline-style headings.

    New Auto-Interp
    Negative Logits
    Textures
    0.46
    0.45
     منح
    0.43
     autonomic
    0.43
     округу
    0.43
    0.42
     treino
    0.42
     లా
    0.41
     поза
    0.41
     thorac
    0.41
    POSITIVE LOGITS
     grandfather
    0.42
    Saint
    0.42
    poison
    0.40
     sulfates
    0.40
     сад
    0.38
    grandfather
    0.38
     করিয়৷
    0.38
     onstage
    0.38
    いただく
    0.38
    Bj
    0.38
    Act Density 0.001%

    No Known Activations