INDEX
    Explanations

    the definite article "the" in various contexts

    New Auto-Interp
    Negative Logits
     someday
    -0.67
    itiz
    -0.66
     outweigh
    -0.65
    strap
    -0.64
    pointers
    -0.64
    apon
    -0.63
    emate
    -0.62
     whoever
    -0.62
    onto
    -0.62
    ional
    -0.62
    POSITIVE LOGITS
     meantime
    1.31
     midst
    1.16
     aftermath
    1.12
     absence
    1.03
     guise
    1.03
     context
    0.99
     simplest
    0.99
     same
    0.96
     nutshell
    0.94
     latter
    0.94
    Act Density 0.161%

    No Known Activations