INDEX
    Explanations

    references to moral justifications for actions and the consequences of those actions

    New Auto-Interp
    Negative Logits
     ProtoMessage
    -0.73
    CppMethod
    -0.72
     informée
    -0.72
     utafitiHapana
    -0.70
     sumpay
    -0.68
    adaptiveStyles
    -0.68
    MessageOf
    -0.67
     EconPapers
    -0.67
    Tikang
    -0.66
    httphttps
    -0.65
    POSITIVE LOGITS
    明明
    0.34
     sobretudo
    0.34
     inevitably
    0.33
    InputTagHelper
    0.32
     vectorielle
    0.32
     når
    0.30
     mennesker
    0.30
    topRight
    0.30
     humaine
    0.30
     éprou
    0.30
    Act Density 3.482%

    No Known Activations