INDEX
    Explanations

    phrases related to self-reflection and personal responsibility

    New Auto-Interp
    Negative Logits
    :✨
    -0.57
     ALONE
    -0.55
    #+#
    -0.54
     Alone
    -0.52
     
    -0.50
    AppMethodBeat
    -0.49
     methodName
    -0.49
    Alone
    -0.49
    alone
    -0.48
    出版年
    -0.48
    POSITIVE LOGITS
    IntoConstraints
    0.52
     mys
    0.45
     себя
    0.43
     otomatig
    0.41
    脚注の使い方
    0.40
    Disliked
    0.40
    σθαι
    0.40
    Flows
    0.39
     się
    0.38
    ตัว
    0.38
    Act Density 0.170%

    No Known Activations