INDEX
    Explanations

    phrases that involve an emphasis on "the" as a definite article and its association with nouns or descriptions

    New Auto-Interp
    Negative Logits
    esian
    -0.15
    folio
    -0.15
    ове
    -0.14
    348
    -0.14
    æľºä¼ļ
    -0.14
    iker
    -0.14
    azzo
    -0.14
    .Router
    -0.13
    eness
    -0.13
     Nisan
    -0.13
    POSITIVE LOGITS
     ones
    0.25
     butt
    0.23
     exception
    0.21
     luck
    0.21
     target
    0.20
     toast
    0.20
     focus
    0.20
     cause
    0.19
    ones
    0.19
     belle
    0.18
    Act Density 0.110%

    No Known Activations