INDEX
    Explanations

    occurrence of opening tags or markers in a structured format

    New Auto-Interp
    Negative Logits
    تقاوى
    -0.51
     esper
    -0.49
     الاطلاع
    -0.48
     horaire
    -0.46
    -0.43
     himself
    -0.43
     tri
    -0.43
    SuppressLint
    -0.42
     виправивши
    -0.42
     beginnetje
    -0.42
    POSITIVE LOGITS
     <=",
    0.87
     Catawiki
    0.74
    Rohy
    0.73
     ddelweddau
    0.67
     Wikimedijinoj
    0.63
     eds
    0.63
     alike
    0.60
    sowie
    0.60
    énario
    0.59
     respectively
    0.59
    Act Density 0.005%

    No Known Activations