INDEX
    Explanations

    instances of the word "did."

    "I'd" or "would" contractions

    New Auto-Interp
    Negative Logits
     not
    -1.00
    not
    -0.75
     Not
    -0.48
     NOT
    -0.47
     tidak
    -0.45
    Not
    -0.44
     nicht
    -0.44
     among
    -0.39
    among
    -0.39
    NOT
    -0.38
    POSITIVE LOGITS
    enumii
    0.57
    iddhar
    0.56
     estekak
    0.56
    ArgsConstructor
    0.55
     typelib
    0.54
     चीज़ों
    0.51
     الحره
    0.51
    нгред
    0.50
    SequentialGroup
    0.50
    Ivoire
    0.49
    Act Density 0.070%

    No Known Activations