INDEX
    Explanations

    text related to Twilight series

    occurrences of the token "tw" in various contexts

    New Auto-Interp
    Negative Logits
     PRESS
    -0.79
    HAEL
    -0.73
    ulative
    -0.72
    senal
    -0.67
    mble
    -0.66
    ãĤ¹ãĥĪ
    -0.65
    MIC
    -0.65
    ãĤŃ
    -0.64
    Journal
    -0.63
    000000
    -0.62
    POSITIVE LOGITS
    enty
    1.08
    tw
    1.01
    elve
    0.96
    orks
    0.92
    ymes
    0.82
    eteen
    0.81
    itness
    0.78
    ixt
    0.78
    actory
    0.77
     Tw
    0.75
    Act Density 0.006%

    No Known Activations