INDEX
    Explanations

    numeric identifiers and related information

    New Auto-Interp
    Negative Logits
    ítulo
    -0.56
     later
    -0.53
    <eos>
    -0.52
    celli
    -0.49
     Dal
    -0.49
     gepubliceerd
    -0.49
     برانيه
    -0.48
    كتشاف
    -0.45
    érables
    -0.45
     Later
    -0.45
    POSITIVE LOGITS
     Efq
    0.81
     <>",
    0.74
     Shakspeare
    0.70
     createState
    0.67
    awtextra
    0.66
     pleaſure
    0.65
     Савезне
    0.65
     Houſe
    0.64
     Majefty
    0.63
     myſelf
    0.63
    Act Density 0.091%

    No Known Activations