INDEX
    Explanations

    punctuations and separators commonly used in code or data formats

    New Auto-Interp
    Negative Logits
    })),
    -0.69
    ']),
    -0.68
    ]');
    -0.64
    '],
    -0.63
    >>,
    -0.61
    ']:
    -0.60
    }');
    -0.59
    )');
    -0.59
    }),
    
    -0.59
    "],
    
    -0.58
    POSITIVE LOGITS
     "
    1.13
    <<"
    1.11
    -"
    1.09
    +"
    1.04
     +"
    1.00
    []{"
    0.92
     -"
    0.92
    .-"
    0.89
    +="
    0.88
    ]+"
    0.88
    Act Density 0.142%

    No Known Activations