INDEX
Explanations
phrases indicating potential outcomes and assessments of effectiveness
New Auto-Interp
Negative Logits
PreferredItem
-0.75
OGND
-0.72
میتوانید
-0.64
ThroughAttribute
-0.64
>(_
-0.57
کردند
-0.56
CURIAM
-0.56
BagLayout
-0.56
Answered
-0.55
communiquez
-0.55
POSITIVE LOGITS
be
3.60
be
1.87
Be
1.69
быть
1.59
Be
1.59
être
1.49
бути
1.48
BE
1.41
være
1.41
być
1.39
Activations Density 1.869%