INDEX
    Explanations

    coded annotations or documentation within programming code

    New Auto-Interp
    Negative Logits
    FunctionFlags
    -0.46
    Accumulator
    -0.39
    canestro
    -0.38
    tgz
    -0.38
     inggris
    -0.37
    RegressionTest
    -0.37
     erkannt
    -0.35
     péld
    -0.35
     Reng
    -0.35
    defn
    -0.35
    POSITIVE LOGITS
     clothing
    0.56
     None
    0.56
     Clothing
    0.55
     lifestyle
    0.54
    Clothing
    0.54
    DockStyle
    0.53
    None
    0.52
    clothing
    0.52
    :✨
    0.51
     none
    0.49
    Act Density 0.094%

    No Known Activations