INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     chromat
    0.52
     decoding
    0.46
     lineare
    0.46
    黑色
    0.45
    を取得
    0.45
    0.43
     cyan
    0.42
     mythical
    0.42
     °
    0.41
     आक्रमण
    0.41
    POSITIVE LOGITS
     altru
    1.20
     selfless
    1.17
     philanthropic
    1.10
     volunteering
    1.09
     philanthrop
    1.05
     philanthropist
    1.00
     philanthropy
    1.00
     compassion
    0.93
     humanitarian
    0.93
     Volunte
    0.93
    Act Density 0.053%

    No Known Activations