INDEX
    Explanations

    references to the treatment and well-being of individuals, particularly marginalized groups

    New Auto-Interp
    Negative Logits
    脚注の使い方
    -0.64
    ScopeManager
    -0.56
     uLocal
    -0.49
    RTSC
    -0.47
     ComVisible
    -0.42
    estacks
    -0.41
     ویکی‌پدیا
    -0.41
    󠁣
    -0.41
    TokenNameDOT
    -0.40
    Availability
    -0.40
    POSITIVE LOGITS
     supported
    1.14
     cared
    1.05
     treated
    0.95
     served
    0.85
     assisted
    0.84
     catered
    0.82
    supported
    0.81
     protected
    0.81
     serviced
    0.79
     attended
    0.78
    Act Density 0.496%

    No Known Activations