INDEX
    Explanations

    instances of formatting or structural elements within the text

    New Auto-Interp
    Negative Logits
    principalTable
    -0.82
    InvalidProtocol
    -0.75
    OGND
    -0.74
    abestanden
    -0.60
     Roskov
    -0.57
    tagHelperRunner
    -0.55
     виправивши
    -0.54
    -0.54
    Бахар
    -0.53
    mehl
    -0.53
    POSITIVE LOGITS
    </em>
    0.71
    ConstraintMaker
    0.71
    _
    
    0.70
     Савезне
    0.69
     Мексичка
    0.67
    "):
    
    0.65
    "]);
    
    0.64
    </i>
    0.64
    ?
    
    0.63
     وتسجيلات
    0.62
    Act Density 0.720%

    No Known Activations