INDEX
Explanations
phrases related to legal, ethical, philosophical, and societal concepts
concepts related to arguments and beliefs about morality and self-interest
New Auto-Interp
Negative Logits
racks
-0.51
Ahead
-0.48
çīĪ
-0.46
tops
-0.45
Boom
-0.44
Shots
-0.43
Drops
-0.43
Shooting
-0.42
Lights
-0.42
DAQ
-0.41
POSITIVE LOGITS
epist
0.51
caus
0.51
ensable
0.51
blance
0.50
morally
0.50
psychologically
0.49
ardless
0.49
subjective
0.49
causation
0.48
akespe
0.47
Activations Density 4.349%