INDEX
    Explanations

    punctuation marks and certain short-format text features

    New Auto-Interp
    Negative Logits
    ”]
    -0.66
    <bos>
    -0.64
     ?'
    -0.61
    "}>
    -0.57
    localctx
    -0.55
    thâu
    -0.55
    ”“
    -0.55
    .”)
    -0.54
    "")
    -0.52
    '}>
    -0.52
    POSITIVE LOGITS
     détru
    0.90
     morire
    0.87
     définiti
    0.79
     menac
    0.79
     détruit
    0.78
     danni
    0.78
     chré
    0.77
     quæ
    0.77
     espirituales
    0.76
     ennemis
    0.76
    Act Density 1.050%

    No Known Activations