INDEX
    Explanations

    terms related to various forms of societal evaluations or judgments

    New Auto-Interp
    Negative Logits
     Poco
    -0.59
    انيف
    -0.58
     ***!
    -0.56
    ้อย
    -0.56
    knapp
    -0.54
    ]),
    
    -0.54
    saida
    -0.53
    Diweddarwch
    -0.52
    Notes
    -0.51
    delo
    -0.50
    POSITIVE LOGITS
     financially
    1.52
     socially
    1.43
     physically
    1.43
     politically
    1.42
     technologically
    1.41
     economically
    1.41
     psychologically
    1.39
     morally
    1.38
     Physically
    1.37
     biologically
    1.36
    Act Density 0.235%

    No Known Activations