INDEX
    Explanations

    phrases related to downplaying or playing down something

    variations of the verb "play" and its derivatives

    New Auto-Interp
    Negative Logits
    spot
    -0.87
     Creat
    -0.71
    vic
    -0.69
    ni
    -0.68
    ust
    -0.66
    pour
    -0.65
    ney
    -0.64
    iph
    -0.64
    aghan
    -0.63
    ci
    -0.63
    POSITIVE LOGITS
    enance
    0.75
    theless
    0.71
    ements
    0.71
     overlook
    0.68
    uate
    0.67
     innocence
    0.66
    Mask
    0.65
    hee
    0.65
    INTON
    0.65
    rarily
    0.63
    Act Density 0.030%

    No Known Activations