INDEX
    Explanations

    coding-related instructions or operations

    New Auto-Interp
    Negative Logits
     оригіналу
    -0.69
    ImageContext
    -0.55
     استنادى
    -0.53
    ForRow
    -0.52
    lück
    -0.51
    OGND
    -0.49
    etheless
    -0.49
    تقاوى
    -0.48
     how
    -0.48
     noDo
    -0.48
    POSITIVE LOGITS
     o
    1.38
     os
    1.18
     um
    0.87
     as
    0.85
     seu
    0.85
     esse
    0.82
     essa
    0.80
     uma
    0.79
     esses
    0.76
     sua
    0.75
    Act Density 0.034%

    No Known Activations