INDEX
    Explanations

    references to reality television and media personalities

    New Auto-Interp
    Negative Logits
     CreateTagHelper
    -0.60
    AsUp
    -0.52
     مواليد
    -0.49
     مشين
    -0.46
    enterOuterAlt
    -0.44
    SBATCH
    -0.42
    jahteraan
    -0.42
    Autoritní
    -0.41
    extAlignment
    -0.40
    fjspx
    -0.39
    POSITIVE LOGITS
     show
    2.23
     shows
    1.85
    show
    1.73
    shows
    1.50
     Show
    1.49
     Shows
    1.46
     шоу
    1.41
     SHOW
    1.38
    Shows
    1.38
    Show
    1.32
    Act Density 0.418%

    No Known Activations