INDEX
    Explanations

    code readability and indentation

    New Auto-Interp
    Negative Logits
     accuracies
    0.41
     empowerment
    0.40
    0.38
     succes
    0.38
     اشاره
    0.38
     safest
    0.37
     garantías
    0.37
    راضي
    0.37
     fittest
    0.37
     confiança
    0.37
    POSITIVE LOGITS
     indentation
    1.45
     spacing
    1.29
     indent
    1.29
    indent
    1.23
     readability
    1.20
    Indent
    1.16
    Spacing
    1.15
     readable
    1.13
     indented
    1.13
    spacing
    1.09
    Act Density 0.071%

    No Known Activations