INDEX
    Explanations

    references to modules or modular structures in a coding context

    New Auto-Interp
    Negative Logits
     Braw
    -0.83
     Bellow
    -0.74
     Justine
    -0.71
     GLFW
    -0.71
     Daria
    -0.70
     diffe
    -0.69
     Krus
    -0.69
    consulté
    -0.68
     Belo
    -0.68
     belie
    -0.67
    POSITIVE LOGITS
     modules
    1.87
     Modules
    1.72
     module
    1.69
     Module
    1.62
    Modules
    1.61
    modules
    1.56
     MODULE
    1.53
    MODULE
    1.52
    Module
    1.51
    module
    1.49
    Act Density 0.080%

    No Known Activations