INDEX
    Explanations

    assessments of success or feedback regarding events or experiences

    New Auto-Interp
    Negative Logits
    };*/
    -0.60
    ]]
    
    -0.58
    ]";
    -0.53
    ='';
    
    -0.51
     maksi
    -0.50
    ]),
    
    -0.50
    ']);
    
    -0.49
    Przypisy
    -0.48
    '])
    
    -0.48
    )];
    
    -0.48
    POSITIVE LOGITS
     success
    0.77
    isSuccessful
    0.77
     Success
    0.74
     successful
    0.74
     sucesso
    0.70
     éxito
    0.70
     liked
    0.69
     SUCCESS
    0.68
     succès
    0.67
    好評
    0.67
    Act Density 0.243%

    No Known Activations