INDEX
Explanations
questions and statements about information sharing and decision-making
preceding questions implying uncertainty
who and what questions
New Auto-Interp
Negative Logits
也很
-0.72
also
-0.69
also
-0.68
也非常
-0.66
también
-0.60
very
-0.60
Очень
-0.60
ook
-0.59
очень
-0.58
almost
-0.57
POSITIVE LOGITS
réellement
0.75
surla
0.74
poszczegól
0.74
บ้าง
0.71
ACTUALLY
0.71
realistically
0.71
fakty
0.70
realmente
0.70
best
0.69
ulike
0.69
Activations Density 0.360%