INDEX
Explanations
questions and expressions of doubt or inquiry
New Auto-Interp
Negative Logits
unfortunately
-0.76
malheureusement
-0.72
purtroppo
-0.62
sadly
-0.61
regrets
-0.60
сожалению
-0.58
niestety
-0.58
unfortunately
-0.57
Unfortunately
-0.57
fortunately
-0.56
POSITIVE LOGITS
Shouldn
1.77
Isn
1.68
Shouldn
1.67
Isn
1.63
shouldn
1.58
isn
1.57
Wouldn
1.54
Wouldn
1.54
Aren
1.52
Aren
1.48
Activations Density 0.248%