INDEX
Explanations
phrases indicating restrictions, limitations, or disclaimers
New Auto-Interp
Negative Logits
câteva
-0.64
sometime
-0.63
almeno
-0.60
Slightly
-0.60
slightly
-0.59
SOME
-0.58
slightly
-0.58
MessageTagHelper
-0.58
somewhat
-0.58
puțin
-0.57
POSITIVE LOGITS
yet
0.76
yet
0.74
YET
0.64
Yet
0.62
Yet
0.62
necessarily
0.59
harmed
0.59
contentLoaded
0.58
necessarily
0.56
exceed
0.56
Activations Density 0.817%