INDEX
    Explanations

    expressions of personal growth and self-acceptance

    New Auto-Interp
    Negative Logits
     yourselves
    -0.67
    私も
    -0.63
    僕も
    -0.63
    ご注意
    -0.61
     we
    -0.59
     silahkan
    -0.57
    彼らは
    -0.54
     themselves
    -0.54
    themselves
    -0.53
    herself
    -0.53
    POSITIVE LOGITS
     Lately
    1.23
     lately
    1.19
     Recently
    0.98
    Recently
    0.95
     recently
    0.93
    recently
    0.89
     whenever
    0.85
     everytime
    0.84
     Whenever
    0.84
     Occasionally
    0.83
    Act Density 0.336%

    No Known Activations