INDEX
    Explanations

    self-esteem and self-respect

    New Auto-Interp
    Negative Logits
     Of
    -2.39
     the
    -2.20
     Then
    -2.11
     deemed
    -2.09
     They
    -2.09
     員
    -2.00
    ギフト
    -2.00
    -1.98
    もありました
    -1.97
    -1.94
    POSITIVE LOGITS
    1.97
    1.95
    1.95
    1.84
    1.80
    1.80
    {
    1.72
    ﹍﹍
    1.70
    1.70
    1.69
    Act Density 0.003%

    No Known Activations