INDEX
    Explanations

    elements related to formatting and style in texts, particularly bold and italics

    New Auto-Interp
    Negative Logits
    rary
    -0.17
    ession
    -0.15
    elez
    -0.15
    ehir
    -0.15
    usk
    -0.14
    onomous
    -0.14
    aniel
    -0.14
    .lu
    -0.14
    igue
    -0.14
    enberg
    -0.14
    POSITIVE LOGITS
     bold
    0.33
    italic
    0.30
     Bold
    0.29
    bold
    0.29
     italic
    0.28
    Italic
    0.28
     Ital
    0.27
    -bold
    0.26
    Bold
    0.26
     underline
    0.25
    Act Density 0.048%

    No Known Activations