INDEX
    Explanations

    actions and descriptors

    New Auto-Interp
    Negative Logits
     वर्गी
    0.41
     revocation
    0.38
    姿勢
    0.37
     మార
    0.36
    0.35
     quitting
    0.35
     قاب
    0.34
    0.34
     防止
    0.33
     greeting
    0.33
    POSITIVE LOGITS
     mentally
    0.44
     physically
    0.43
     liberate
    0.41
     regra
    0.41
     put
    0.38
     gently
    0.38
     somehow
    0.37
     rendere
    0.37
     lovingly
    0.37
     artificially
    0.36
    Act Density 0.220%

    No Known Activations