INDEX
    Explanations

    repetitions of the word "its" and variations in capitalization

    New Auto-Interp
    Negative Logits
    COMPARE
    -0.17
    ALER
    -0.17
    istrator
    -0.16
    CDATA
    -0.15
    YTE
    -0.15
    infeld
    -0.15
    trag
    -0.15
    ÑĥÑĢг
    -0.15
    جاد
    -0.15
    icerca
    -0.15
    POSITIVE LOGITS
     lung
    0.16
    ai
    0.15
    ra
    0.15
    m
    0.15
    ch
    0.15
    g
    0.15
    x
    0.14
    d
    0.14
    creen
    0.14
     Lung
    0.14
    Act Density 0.031%

    No Known Activations