INDEX
Explanations
comparative phrases related to expectations
New Auto-Interp
Negative Logits
secundario
-0.49
Roskov
-0.45
wondering
-0.41
extranjero
-0.41
ADDED
-0.41
plegable
-0.40
essentiel
-0.40
nocturno
-0.40
added
-0.39
obtenido
-0.39
POSITIVE LOGITS
ValueStyle
0.65
imagined
0.60
bargain
0.59
IsContent
0.59
hoped
0.59
OGND
0.58
الرياضيه
0.58
anticipated
0.56
ModelExpression
0.52
imagined
0.52
Activations Density 0.286%