INDEX
Explanations
expressions of uncertainty and requests for guidance
seeking solutions to problems
New Auto-Interp
Negative Logits
Portály
-0.63
esModule
-0.61
Wikimedijinoj
-0.59
برانيه
-0.56
ModelExpression
-0.55
extAlignment
-0.55
الرياضيه
-0.55
Manbalar
-0.54
Архівовано
-0.54
//});
-0.53
POSITIVE LOGITS
incl
0.41
incl
0.39
ونج
0.35
stag
0.35
도
0.35
uchen
0.34
ugh
0.34
ila
0.33
مط
0.32
plain
0.32
Activations Density 0.070%