INDEX
Explanations
references to core values and principles within an organization
New Auto-Interp
Negative Logits
bÃło
-0.16
åύ
-0.15
.opengl
-0.15
ÄIJT
-0.15
iks
-0.14
oogle
-0.14
олн
-0.14
olare
-0.14
asd
-0.14
물
-0.13
POSITIVE LOGITS
asha
0.16
principles
0.16
values
0.15
/values
0.15
Values
0.14
ameleon
0.14
_values
0.14
giy
0.14
believes
0.14
imum
0.13
Activations Density 0.098%