INDEX
Explanations
elements related to social justice and the consequences of actions
New Auto-Interp
Negative Logits
due
-0.24
due
-0.24
Due
-0.23
istrovstvÃŃ
-0.21
Due
-0.21
owing
-0.20
given
-0.20
_due
-0.19
eldom
-0.17
Ñħи
-0.17
POSITIVE LOGITS
BE
0.36
-b
0.28
because
0.27
Prec
0.27
precisely
0.27
BE
0.26
prec
0.26
partially
0.25
-be
0.25
partly
0.24
Activations Density 0.160%