INDEX
Explanations
terms related to various forms of societal evaluations or judgments
New Auto-Interp
Negative Logits
Poco
-0.59
انيف
-0.58
***!
-0.56
้อย
-0.56
knapp
-0.54
]),
-0.54
saida
-0.53
Diweddarwch
-0.52
Notes
-0.51
delo
-0.50
POSITIVE LOGITS
financially
1.52
socially
1.43
physically
1.43
politically
1.42
technologically
1.41
economically
1.41
psychologically
1.39
morally
1.38
Physically
1.37
biologically
1.36
Activations Density 0.235%