INDEX
Explanations
phrases related to benefits and positive outcomes
New Auto-Interp
Negative Logits
tiv
-0.19
allet
-0.17
eron
-0.15
-0.15
ern
-0.15
y
-0.15
keit
-0.14
itty
-0.14
-за
-0.14
eli
-0.14
POSITIVE LOGITS
fully
0.19
icial
0.17
ably
0.17
728
0.15
jer
0.15
inand
0.15
/***/
0.14
uD
0.14
ycastle
0.14
benefits
0.14
Activations Density 0.054%