INDEX
Explanations
phrases that highlight observations or notes about information
New Auto-Interp
Negative Logits
AsUp
-0.88
)";
-0.76
kloped
-0.74
')";
-0.71
ftagPool
-0.70
initComponents
-0.68
"]);
-0.67
"){
-0.67
%")
-0.66
')"
-0.66
POSITIVE LOGITS
Note
0.84
Note
0.83
note
0.71
NOTE
0.70
Remember
0.70
remember
0.69
Remember
0.68
note
0.64
NOTE
0.64
REMEMBER
0.63
Activations Density 0.187%