INDEX
Explanations
phrases indicating the concept of choice or decision-making
New Auto-Interp
Negative Logits
pitié
-0.86
câte
-0.81
scurt
-0.76
lagoons
-0.73
ไง
-0.72
băr
-0.72
-0.72
dermatologist
-0.71
longitudinally
-0.71
"}>
-0.71
POSITIVE LOGITS
choice
2.91
choices
2.71
choice
2.70
Choice
2.67
Choice
2.58
CHOICE
2.54
Choices
2.39
choices
2.27
CHOICE
2.24
Choices
2.21
Activations Density 0.047%