INDEX
Explanations
phrases that indicate mandatory actions or emphasize strong emotional expressions
New Auto-Interp
Negative Logits
تقاوى
-0.89
})).
-0.88
()]);
-0.87
."]
-0.85
]);
-0.79
."));
-0.77
"]);
-0.76
.');
-0.75
FieldBuilder
-0.75
.");
-0.74
POSITIVE LOGITS
\{\\0.82
di
0.56
“
0.53
win
0.51
mistic
0.51
ised
0.50
姆斯
0.50
“
0.49
Win
0.47
iastes
0.47
Activations Density 0.026%