INDEX
    Explanations

    occurrences of the word "this"

    New Auto-Interp
    Negative Logits
    Personendaten
    -1.00
    ']")
    -0.94
     noDo
    -0.90
     étoient
    -0.84
     chofe
    -0.84
     Efq
    -0.83
     uſed
    -0.82
    )}_
    -0.82
     Shakspeare
    -0.82
    "]);
    
    -0.82
    POSITIVE LOGITS
    .
    0.67
    ochem
    0.65
     the
    0.60
    ValueStyle
    0.60
    legt
    0.57
     кв
    0.53
     others
    0.53
     et
    0.52
     a
    0.51
     pan
    0.51
    Act Density 0.083%

    No Known Activations