INDEX
Explanations
themes of moral and ethical considerations, particularly related to treatment and respect for individuals
New Auto-Interp
Negative Logits
abbond
-0.47
ease
-0.46
tropicales
-0.45
Fiche
-0.45
menghi
-0.44
อัน
-0.44
duly
-0.42
duidelijk
-0.42
pertinentes
-0.42
Meksiku
-0.41
POSITIVE LOGITS
differently
1.49
autrement
0.94
diffé
0.90
Biôgrafia
0.84
anders
0.84
Differ
0.81
like
0.80
цездатний
0.80
similarly
0.77
wrong
0.76
Activations Density 0.590%