INDEX
Explanations
phrases related to evaluations or assessments with optimistic or favorable connotations
New Auto-Interp
Negative Logits
[](
-0.57
Autres
-0.57
httphttps
-0.57
étrangère
-0.56
chrétienne
-0.56
interopRequire
-0.54
OwnProperty
-0.54
préparé
-0.53
intenant
-0.52
religieuse
-0.52
POSITIVE LOGITS
worst
0.77
OMITBAD
0.65
worst
0.65
scenario
0.64
kiệm
0.64
Scenario
0.63
Scenario
0.61
мум
0.59
minimum
0.59
MINIMUM
0.58
Activations Density 0.479%