INDEX
    Explanations

    occurrences of the word "do" and its variations

    New Auto-Interp
    Negative Logits
     houſe
    -1.01
     ſche
    -0.98
     itſelf
    -0.86
     stiefel
    -0.86
     ſever
    -0.86
     Efq
    -0.85
     Jefus
    -0.85
     <<<<<<<<<<<<<<
    -0.85
     themſelves
    -0.84
     pleaſure
    -0.84
    POSITIVE LOGITS
     do
    1.47
     done
    1.25
     does
    1.24
     Do
    1.22
     doing
    1.19
    Do
    1.17
     Doing
    1.15
     did
    1.13
    Doing
    1.11
     DOING
    1.10
    Act Density 0.157%

    No Known Activations