INDEX
    Explanations

    words related to giving credit or acknowledgment

    New Auto-Interp
    Negative Logits
    SPONSORED
    -0.81
    fab
    -0.74
    çİĭ
    -0.71
    STON
    -0.67
    ichick
    -0.66
     Osw
    -0.65
    ews
    -0.65
    ools
    -0.65
    dose
    -0.63
    HF
    -0.63
    POSITIVE LOGITS
     representation
    0.62
     accuracy
    0.60
     equal
    0.60
     comparable
    0.60
     Dying
    0.57
     strictly
    0.57
     understatement
    0.56
     outnumbered
    0.56
     reviewer
    0.56
     bra
    0.55
    Act Density 0.106%

    No Known Activations