INDEX
Explanations
expressions related to moral evaluations and legality
New Auto-Interp
Negative Logits
given
-0.18
Ñħи
-0.18
istrovstvÃŃ
-0.17
Given
-0.16
given
-0.16
Considering
-0.16
><?
-0.16
eldom
-0.16
VF
-0.15
considering
-0.15
POSITIVE LOGITS
because
0.38
BE
0.36
precisely
0.33
because
0.31
Because
0.31
porque
0.31
prec
0.31
Because
0.30
поÑĤомÑĥ
0.29
Prec
0.28
Activations Density 0.147%