INDEX
Explanations
structure and metadata in code, particularly related to definitions and configurations
New Auto-Interp
Negative Logits
'';
-0.69
='';
-0.65
"";
-0.62
'',
-0.61
"");
-0.58
="";
-0.58
('');
-0.57
''
-0.57
("");
-0.55
"")
-0.55
POSITIVE LOGITS
['
1.17
["
1.16
=["
0.99
=['
0.97
(['
0.95
(["
0.94
["
0.94
":["
0.89
['
0.89
[]
0.88
Activations Density 0.146%