INDEX
    Explanations

    phrases indicating criticism or concern about social issues and injustices

    New Auto-Interp
    Negative Logits
    Verdana
    -0.08
    ILA
    -0.07
    .appspot
    -0.07
    ouns
    -0.07
    μÏĨ
    -0.07
    acter
    -0.07
    ISCO
    -0.07
    ila
    -0.07
    prung
    -0.06
    _initializer
    -0.06
    POSITIVE LOGITS
    å¦ĤæŃ¤
    0.09
    竣
    0.07
     such
    0.07
     regress
    0.07
     so
    0.07
    akit
    0.07
     while
    0.06
     modern
    0.06
     basic
    0.06
     grown
    0.06
    Act Density 0.024%

    No Known Activations