INDEX
    Explanations

    attributes or settings related to UI components

    New Auto-Interp
    Negative Logits
    "):↵
    -0.18
    ))):↵
    -0.18
    '):↵
    -0.18
    ])):↵
    -0.17
    ']):↵
    -0.17
    ':↵
    -0.17
    ')):↵
    -0.16
    '):
    -0.16
    ":↵
    -0.15
    ']:↵
    -0.15
    POSITIVE LOGITS
    "/>↵
    0.47
     "/
    0.41
    }/>↵
    0.40
    '/>↵
    0.39
    "/>
    0.39
     />↵
    0.39
    "/>↵↵
    0.38
    />↵
    0.35
    }/
    0.35
    '/
    0.34
    Act Density 0.071%

    No Known Activations