INDEX
Explanations
affirmations related to the quality of experiences or products
New Auto-Interp
Negative Logits
fails
-0.15
689
-0.15
failed
-0.15
P
-0.14
-0.14
ür
-0.14
Fern
-0.14
Dear
-0.14
/
-0.14
fail
-0.14
POSITIVE LOGITS
chu
0.15
ppo
0.14
rganization
0.14
güncel
0.14
ruh
0.14
±Ð¾ÑĤ
0.14
Cousins
0.14
ignon
0.14
chy
0.13
ÑģобоÑİ
0.13
Activations Density 0.294%