INDEX
Explanations
descriptive labels in tables
New Auto-Interp
Negative Logits
!).
0.75
!;
0.72
!);
0.71
;
0.70
’!
0.70
!<
0.69
!');
0.68
!";
0.67
!';
0.67
!".
0.67
POSITIVE LOGITS
1.01
1.00
1.00
0.97
0.97
0.94
0.94
0.94
0.93
0.92
Activations Density 0.271%