INDEX
    Explanations

    interrogative punctuation and question marks

    New Auto-Interp
    Negative Logits
    emer
    -0.17
     seedu
    -0.16
    ingham
    -0.16
    /dr
    -0.16
    ifton
    -0.15
    ses
    -0.15
    side
    -0.15
    elsing
    -0.15
    licken
    -0.15
    keit
    -0.15
    POSITIVE LOGITS
    rious
    0.17
     Horton
    0.15
    ively
    0.15
    aneously
    0.14
    tings
    0.14
    groundColor
    0.14
    uating
    0.14
     طرÙĬÙĤ
    0.14
    ously
    0.14
    onne
    0.14
    Act Density 0.030%

    No Known Activations