INDEX
    Explanations

    concepts related to treatment, behavior, and ethics in interpersonal and societal contexts

    Describes actions or behaviors, often negative

    how things are done or treated

    New Auto-Interp
    Negative Logits
     يتيمه
    -0.69
    -0.68
    AntiForgeryToken
    -0.61
    fillType
    -0.61
    IndexPath
    -0.61
    IsContent
    -0.59
    IsMutable
    -0.56
    Naissance
    -0.53
    yorsunuz
    -0.53
    ungsbedingungen
    -0.52
    POSITIVE LOGITS
     differently
    1.46
     accordingly
    0.94
     according
    0.94
     incorrectly
    0.93
     diffé
    0.85
    differ
    0.79
     how
    0.78
     wrong
    0.77
     correctly
    0.76
     wrongly
    0.73
    Act Density 0.401%

    No Known Activations