INDEX
    Explanations

    formatting elements and structural markers within the text

    New Auto-Interp
    Negative Logits
     ++
    -0.45
    <eos>
    -0.45
    ได้
    -0.43
    while
    -0.43
     fine
    -0.43
    rrggbb
    -0.42
     vrienden
    -0.42
    ✭✭
    -0.42
    é
    -0.41
     szczegó
    -0.41
    POSITIVE LOGITS
     صوتيه
    0.71
    findpost
    0.66
     AssemblyTitle
    0.64
    مصادر
    0.62
    tagHelperRunner
    0.62
     للمعارف
    0.61
    StoreMessageInfo
    0.61
    bibinfo
    0.60
     стаття
    0.59
     विश्वसनीयता
    0.59
    Act Density 0.019%

    No Known Activations