INDEX
    Explanations

    phrases indicating comparisons or references

    New Auto-Interp
    Negative Logits
     صوتيه
    -0.59
    AndEndTag
    -0.45
    slidesPer
    -0.43
     CreateTagHelper
    -0.43
     courtesy
    -0.43
    -0.42
    的神
    -0.41
    лон
    -0.40
     beh
    -0.40
     taip
    -0.40
    POSITIVE LOGITS
    findpost
    0.92
    //-->
    0.75
    äsident
    0.74
     ostavi
    0.70
    autaire
    0.67
     otomatig
    0.67
    قایناق‌لار
    0.66
    ')")
    0.64
    uteen
    0.64
    //});
    0.63
    Act Density 0.225%

    No Known Activations