INDEX
Explanations
numerical values and mathematical expressions
Letters in multiple choice answers
New Auto-Interp
Negative Logits
invité
-0.42
singoli
-0.41
coût
-0.40
Lump
-0.40
agio
-0.39
peur
-0.38
lisäksi
-0.38
Buen
-0.38
judiciales
-0.38
rús
-0.37
POSITIVE LOGITS
None
1.21
none
1.18
None
1.13
correct
1.03
Correct
1.02
none
0.99
option
0.97
options
0.96
Correct
0.96
NONE
0.95
Activations Density 0.629%