INDEX
    Explanations

    colons and structural elements in code or text formatting

    New Auto-Interp
    Negative Logits
    "])
    -0.72
    "]))
    -0.70
    .”)
    -0.69
    }`)
    -0.69
    Cyfarwyddwr
    -0.68
    ]")
    -0.66
    ]."
    -0.66
    )});
    -0.66
    >>()
    -0.66
    ())));
    -0.65
    POSITIVE LOGITS
     __('
    0.97
    [:
    0.93
     _('
    0.91
    (&:
    0.89
    !(:
    0.86
     nameof
    0.83
    (__('
    0.81
     &___
    0.81
    (:
    0.81
    nameof
    0.80
    Act Density 0.051%

    No Known Activations