INDEX
    Explanations

    references to respect and honor in social interactions

    New Auto-Interp
    Negative Logits
    PreferredItem
    -0.72
    +:+
    -0.72
     NgModule
    -0.71
    RectangleBorder
    -0.67
    feira
    -0.61
    AutoScaleMode
    -0.61
     ویکی‌پدیا
    -0.60
    fillType
    -0.59
    BoxShadow
    -0.57
    styleType
    -0.57
    POSITIVE LOGITS
     gentlemen
    1.00
     gentleman
    0.87
     Gentlemen
    0.79
    Gentlemen
    0.77
    0.72
    ท่าน
    0.70
    gentle
    0.65
     kindly
    0.65
    さん
    0.60
     folks
    0.57
    Act Density 0.203%

    No Known Activations