INDEX
    Explanations

    terms related to sexual harassment

    New Auto-Interp
    Negative Logits
     AspNetCore
    -0.91
    abestanden
    -0.91
     BorderSide
    -0.91
     sumpay
    -0.88
     SEDS
    -0.88
     оригіналу
    -0.87
     héro
    -0.87
    IUrlHelper
    -0.87
     ſche
    -0.86
     Reſ
    -0.85
    POSITIVE LOGITS
     Har
    1.28
    Har
    1.27
     HAR
    1.16
    har
    1.13
     har
    1.11
     Harman
    1.09
     Harlow
    1.02
    HAR
    0.99
     Harrell
    0.96
     Harrington
    0.94
    Act Density 0.024%

    No Known Activations