INDEX
Explanations
occurrences of specific formatting or annotation patterns in code comments
New Auto-Interp
Negative Logits
abile
-0.15
jn
-0.15
hic
-0.14
219
-0.14
ahoma
-0.14
ase
-0.13
kám
-0.13
wav
-0.13
Hell
-0.13
inent
-0.13
POSITIVE LOGITS
@
0.18
@g
0.16
aint
0.15
version
0.15
/{{0.14
ingleton
0.14
version
0.14
Roh
0.14
lich
0.13
Smy
0.13
Activations Density 0.005%