INDEX
    Explanations

    the phrase "the only thing" followed by a noun or gerund

    repeated references to "the only thing" or similar phrases that emphasize singular importance or focus

    New Auto-Interp
    Negative Logits
    ãĤ´ãĥ³
    -0.80
    baugh
    -0.79
    blast
    -0.74
    oufl
    -0.70
     fixme
    -0.69
    har
    -0.66
    println
    -0.66
    nec
    -0.66
    holm
    -0.66
    xtap
    -0.65
    POSITIVE LOGITS
     that
    0.98
     happening
    0.87
     we
    0.84
     separating
    0.79
     you
    0.79
     THAT
    0.79
     bothering
    0.75
     they
    0.74
     preventing
    0.74
     I
    0.73
    Act Density 0.061%

    No Known Activations