INDEX
    Explanations

    punctuation and formatting elements in the text

    New Auto-Interp
    Negative Logits
     насељу
    -0.79
     autorytatywna
    -0.74
    +#+
    -0.72
    mergeFrom
    -0.71
    AlterField
    -0.68
    SizeMode
    -0.68
    tonsoft
    -0.67
    msgSender
    -0.66
    — 
    -0.65
    `,
    
    -0.65
    POSITIVE LOGITS
    ↵↵↵
    0.72
    <eos>
    0.72
    ↵↵
    0.66
    ↵↵↵↵↵↵
    0.60
    ↵↵↵↵
    0.59
    ↵↵↵↵↵
    0.59
    ↵↵↵↵↵↵↵↵↵
    0.59
    ↵↵↵↵↵↵↵
    0.58
    0.54
     виправивши
    0.54
    Act Density 0.112%

    No Known Activations