INDEX
Explanations
phrases that express doubt or uncertainty
unexpected outcomes
New Auto-Interp
Negative Logits
awsze
-0.49
Населення
-0.48
صوتيه
-0.44
AndEndTag
-0.43
resourceCulture
-0.43
prestazioni
-0.43
statechange
-0.41
GenerationType
-0.40
azioni
-0.40
CrossRef
-0.40
POSITIVE LOGITS
siquiera
0.68
even
0.66
even
0.57
даже
0.56
навіть
0.56
EVEN
0.55
überhaupt
0.54
Even
0.54
すら
0.53
дори
0.53
Activations Density 0.186%