INDEX
    Explanations

    special characters and punctuation marks in the text

    Code or mathematical expressions

    academic citations and formatting

    New Auto-Interp
    Negative Logits
     {},
    
    -0.92
     [],
    
    -0.87
    /**
    
    
    -0.87
    .",
    
    -0.86
    NUMX
    -0.85
     '',
    
    -0.85
    `,
    
    -0.78
    ',
    
    
    -0.77
    (),
    
    -0.76
     Мексичка
    -0.76
    POSITIVE LOGITS
    ↵↵
    1.80
    1.40
    ↵↵↵
    1.19
    ↵↵↵↵
    1.01
    <eos>
    0.95
    ↵↵↵↵↵
    0.84
    ↵↵↵↵↵↵
    0.74
    </blockquote>
    0.73
    </h3>
    0.69
    </strong>
    0.68
    Act Density 0.418%

    No Known Activations