INDEX
    Explanations

    mentions of the word "old" in various contexts

    the word "gold" in various contexts

    New Auto-Interp
    Negative Logits
    Downloadha
    -0.73
    BILITY
    -0.72
    senal
    -0.71
     FANTASY
    -0.67
    OPLE
    -0.66
    involved
    -0.66
    ESA
    -0.65
     Pwr
    -0.64
     dism
    -0.61
    FUL
    -0.61
    POSITIVE LOGITS
    orf
    1.18
    ynam
    1.10
    ouble
    1.05
    roid
    1.04
    rums
    1.03
    ership
    0.97
    ritch
    0.95
    irect
    0.94
    ynamic
    0.92
    iesel
    0.92
    Act Density 0.019%

    No Known Activations