INDEX
Explanations
variable assignments and connections
New Auto-Interp
Negative Logits
}}}=
0.85
};
0.81
}});
0.78
)})$
0.73
}})
0.73
}=
0.71
))}
0.71
}))
0.70
)})
0.70
}')
0.70
POSITIVE LOGITS
::
1.00
::
0.95
:-
0.81
<-
0.79
.-
0.74
->
0.72
>>
0.71
<-
0.70
፡
0.68
<<
0.66
Activations Density 0.240%