INDEX
    Explanations

    references to "Old" in various contexts

    New Auto-Interp
    Negative Logits
    ushi
    -0.17
    .cx
    -0.17
    atrix
    -0.14
    itious
    -0.14
    091
    -0.14
    Dialogue
    -0.14
    plorer
    -0.14
    ohl
    -0.14
    ote
    -0.14
    alyze
    -0.14
    POSITIVE LOGITS
    enburg
    0.28
    ÅĻich
    0.26
    endor
    0.24
    -fashioned
    0.22
     fashioned
    0.21
    /New
    0.21
     testament
    0.19
     Testament
    0.19
    ENDOR
    0.19
    -new
    0.18
    Act Density 0.028%

    No Known Activations