INDEX
    Explanations

    function call patterns in programming code

    New Auto-Interp
    Negative Logits
    ")))
    -0.85
    "
    -0.84
     "
    
    -0.76
    riwal
    -0.72
    ...]
    -0.72
    "]
    
    -0.70
     zatem
    -0.69
     })}
    -0.68
    $"
    -0.68
     }
    -0.67
    POSITIVE LOGITS
    ($
    1.62
    (!__
    1.28
     ($
    1.24
    (($
    1.18
     (($
    1.16
     ($\
    1.15
    ($_
    1.12
    (_
    1.09
    ($__
    1.09
     (£
    1.07
    Act Density 0.086%

    No Known Activations