INDEX
Explanations
occurrences of the character 'Y' or variables starting with 'Y'
New Auto-Interp
Negative Logits
)");
-1.08
"");
-1.07
')")
-0.99
]');
-0.91
)";
-0.89
'));
-0.89
/');
-0.89
')],
-0.87
>');
-0.86
).]
-0.86
POSITIVE LOGITS
Y
1.87
Y
1.48
y
1.34
getY
1.31
Yel
1.21
Yag
1.14
Yvette
1.06
Yud
1.06
getY
1.04
yolk
1.02
Activations Density 0.085%