INDEX
Explanations
repetitive patterns or mentions of specific functions and variables in code, particularly related to logging and traceability
New Auto-Interp
Negative Logits
"
-0.87
“
-0.81
<eos>
-0.65
↵↵
-0.60
at
-0.60
(
-0.58
'
-0.52
(
-0.51
“
-0.51
[
-0.50
POSITIVE LOGITS
متعلقه
1.70
iſt
1.09
المعيارى
1.06
Efq
1.05
ReusableCell
1.04
وتسجيلات
1.04
createState
1.03
שוליים
1.02
مشين
1.01
houſe
0.99
Activations Density 0.471%