INDEX
Explanations
references to questions and questionnaire contexts
New Auto-Interp
Negative Logits
mijne
-0.93
myſelf
-0.85
$_"
-0.85
himſelf
-0.84
elfare
-0.82
lepiej
-0.82
utuhkan
-0.82
gloire
-0.81
(;;)
-0.81
poffible
-0.81
POSITIVE LOGITS
questions
2.11
question
2.01
Question
1.92
question
1.81
Questions
1.80
Questions
1.73
questions
1.71
QUESTION
1.71
Question
1.71
QUESTION
1.51
Activations Density 0.037%