INDEX
Explanations
phrases indicating actions or decisions that are expected or required in various contexts
New Auto-Interp
Negative Logits
ayi
-0.15
eut
-0.15
alex
-0.15
олева
-0.15
ushing
-0.15
ABEL
-0.14
antage
-0.14
Ïīνα
-0.14
ết
-0.14
uko
-0.14
POSITIVE LOGITS
religion
0.20
nutrition
0.17
.Toolkit
0.16
âĹĦ
0.15
being
0.15
education
0.15
wellness
0.15
<>
0.14
prevention
0.14
Religion
0.14
Activations Density 0.406%