INDEX
    Explanations

    the word "are" in various contexts

    the word "are," indicating emphasis on being or existence

    New Auto-Interp
    Negative Logits
    omez
    -0.69
    ingen
    -0.67
    uration
    -0.65
    imating
    -0.61
    oting
    -0.59
    allery
    -0.59
    ured
    -0.59
    ertodd
    -0.59
    imation
    -0.58
    OOL
    -0.58
    POSITIVE LOGITS
    nce
    1.03
    nces
    1.01
    tsky
    1.00
    tto
    0.97
    nda
    0.88
    tta
    0.87
    nt
    0.86
    nd
    0.83
    zza
    0.82
    lli
    0.81
    Act Density 0.019%

    No Known Activations