INDEX
    Explanations

    mentions of a specific word 'Ze' followed by a number

    occurrences of the word "ze" in various contexts

    New Auto-Interp
    Negative Logits
     behavi
    -0.89
     ancest
    -0.79
    Interstitial
    -0.75
    ials
    -0.74
     reconc
    -0.73
    IAL
    -0.70
     bullish
    -0.66
    ancial
    -0.66
     recre
    -0.65
     reluct
    -0.64
    POSITIVE LOGITS
    ze
    1.26
    lda
    1.20
    ppelin
    1.02
    ÅĤ
    1.00
    zes
    0.98
    ppe
    0.96
    zy
    0.93
    ggle
    0.89
    itsch
    0.88
    ppy
    0.86
    Act Density 0.006%

    No Known Activations