INDEX
    Explanations

    terms that pertain to moral values and ethical considerations

    New Auto-Interp
    Negative Logits
     GeoNames
    -0.93
    -0.82
     يتيمه
    -0.80
     Walkover
    -0.75
    MLLoader
    -0.71
    })();
    
    -0.71
     CreateTagHelper
    -0.70
    FailureListener
    -0.69
     archipelago
    -0.69
    ]),
    
    -0.68
    POSITIVE LOGITS
     moral
    1.47
    Mor
    1.37
     morales
    1.34
     Moral
    1.33
    mor
    1.31
     mor
    1.30
    moral
    1.30
     Mor
    1.29
     morals
    1.27
     MOR
    1.27
    Act Density 0.112%

    No Known Activations