INDEX
    Explanations

    text related to ethical standards, morality, and breaches of ethics

    New Auto-Interp
    Negative Logits
     sebastian
    -0.62
     halle
    -0.56
     vinyle
    -0.55
     suga
    -0.55
     parma
    -0.55
     claudia
    -0.55
    cupa
    -0.54
     luis
    -0.54
     Molière
    -0.54
     Mlle
    -0.53
    POSITIVE LOGITS
     ethics
    1.44
     Ethics
    1.32
     ethical
    1.26
    Ethics
    1.26
    ethics
    1.16
     Ethical
    1.09
    ethical
    1.09
    Ethical
    1.06
     ethically
    1.04
     ethic
    0.98
    Act Density 0.061%

    No Known Activations