INDEX
Explanations
discussions about influence, representation, and standards in societal or organizational contexts
New Auto-Interp
Negative Logits
enough
-0.15
-of
-0.15
orf
-0.14
ThemeProvider
-0.14
uyết
-0.13
ateria
-0.13
.GraphicsUnit
-0.13
CHandle
-0.13
otre
-0.13
errick
-0.13
POSITIVE LOGITS
than
0.40
-than
0.38
than
0.38
_than
0.31
niż
0.29
Than
0.29
THAN
0.28
Than
0.26
než
0.23
_THAN
0.21
Activations Density 0.319%