INDEX
Explanations
indicators of uncertainty or skepticism regarding outcomes
New Auto-Interp
Negative Logits
ri
-0.47
«
-0.47
motic
-0.41
"
-0.41
'
-0.40
():
-0.39
врат
-0.39
(!__
-0.39
entre
-0.39
-0.39
POSITIVE LOGITS
entanto
2.00
however
1.97
however
1.87
però
1.78
όμως
1.58
toutefois
1.52
cependant
1.52
tuttavia
1.50
însă
1.50
However
1.49
Activations Density 0.562%