INDEX
Explanations
terms related to decision-making and accountability in various contexts
New Auto-Interp
Negative Logits
typelib
-0.52
<<<<<<<<<<<<<<
-0.52
ArrowToggle
-0.51
unknownFields
-0.48
RenderAtEndOf
-0.47
كومونز
-0.46
հղումներ
-0.43
NameInMap
-0.43
SharedCtor
-0.43
дописавши
-0.42
POSITIVE LOGITS
s
1.20
ſhe
0.87
그녀
0.85
she
0.84
她們
0.83
herself
0.82
herself
0.79
她们
0.75
her
0.75
她
0.71
Activations Density 0.493%