INDEX
Explanations
phrases related to moral and ethical solutions or judgments
New Auto-Interp
Negative Logits
.activ
-0.16
WR
-0.15
pson
-0.14
ansi
-0.14
mdir
-0.14
aupt
-0.14
åĭĻ
-0.14
éŀ
-0.14
errick
-0.14
Σα
-0.13
POSITIVE LOGITS
accordingly
0.27
correspond
0.27
corresponding
0.24
Correspond
0.20
Accordingly
0.18
corres
0.18
likewise
0.18
accompanying
0.17
ÑģооÑĤвеÑĤÑģÑĤв
0.17
correspondent
0.16
Activations Density 0.212%