INDEX
    Explanations

    instances of the article "a" and variations of it

    New Auto-Interp
    Negative Logits
    ONS
    -0.16
    .break
    -0.15
    uft
    -0.15
    uco
    -0.15
    ons
    -0.14
    ond
    -0.14
    agli
    -0.14
    rup
    -0.14
    .Xaml
    -0.14
    pliant
    -0.14
    POSITIVE LOGITS
    akan
    0.18
    ighton
    0.16
    θεÏģ
    0.15
    encil
    0.14
    idi
    0.14
    ermen
    0.14
    oice
    0.14
     ago
    0.14
    ije
    0.14
    orer
    0.14
    Act Density 0.078%

    No Known Activations