INDEX
    Explanations

    the word "only" in various contexts, emphasizing exclusivity or singularity

    New Auto-Interp
    Negative Logits
    elt
    -0.15
    _iff
    -0.13
    cken
    -0.13
    ennen
    -0.13
    кова
    -0.13
     enough
    -0.13
    .import
    -0.13
    atica
    -0.13
     _$
    -0.13
    pga
    -0.12
    POSITIVE LOGITS
     thing
    0.35
     remaining
    0.28
     Thing
    0.24
    remaining
    0.24
    thing
    0.24
     way
    0.23
    Thing
    0.23
    Remaining
    0.22
     ones
    0.21
    (thing
    0.21
    Act Density 0.043%

    No Known Activations