INDEX
    Explanations

    words related to historical origins or original intentions

    New Auto-Interp
    Negative Logits
    	Copyright
    -0.07
     newObj
    -0.06
     new
    -0.06
    :checked
    -0.06
    uli
    -0.06
    loud
    -0.06
     heritage
    -0.06
     finally
    -0.06
    elize
    -0.06
     recent
    -0.06
    POSITIVE LOGITS
     originally
    0.08
    [section
    0.08
     intended
    0.08
     initially
    0.08
     yalnızca
    0.07
     called
    0.07
    arker
    0.07
    called
    0.07
    /original
    0.07
     only
    0.07
    Act Density 0.025%

    No Known Activations