INDEX
Explanations
comment and documentation markers in code
New Auto-Interp
Negative Logits
de
-0.79
đ
-0.78
-
-0.70
ness
-0.69
le
-0.69
gran
-0.68
erd
-0.67
of
-0.66
portato
-0.66
er
-0.66
POSITIVE LOGITS
)*/
1.68
})*/
1.54
.*/
1.43
();*/
1.42
};*/
1.42
*/
1.36
);*/
1.35
}*/
1.33
;*/
1.31
]-->
1.28
Activations Density 0.066%