INDEX
    Explanations

    terms and concepts related to ethics and morality

    New Auto-Interp
    Negative Logits
     stroke
    -0.52
     inacces
    -0.51
     excitedly
    -0.51
     Haddad
    -0.49
    }),
    
    -0.49
    VH
    -0.48
     Stedman
    -0.48
    -0.48
    Identyfik
    -0.48
     definitive
    -0.47
    POSITIVE LOGITS
     moral
    1.08
     Moral
    1.06
    Moral
    0.99
     ethics
    0.98
     ethical
    0.97
     morals
    0.96
    moral
    0.95
    Ethics
    0.93
     morality
    0.93
     Ethics
    0.91
    Act Density 0.348%

    No Known Activations