INDEX
    Explanations

    mentions of discussions or topics to be discussed

    instances of the word "discuss."

    New Auto-Interp
    Negative Logits
    served
    -0.74
    peria
    -0.74
    eared
    -0.73
     robbed
    -0.72
    occupied
    -0.71
    ifter
    -0.69
    pes
    -0.69
    installed
    -0.67
    bott
    -0.66
    gged
    -0.65
    POSITIVE LOGITS
    Discuss
    1.03
     discussing
    0.96
     discuss
    0.93
     Discuss
    0.92
     Topics
    0.90
     discusses
    0.90
     topics
    0.81
    ļéĨĴ
    0.76
     summarizes
    0.74
     discussed
    0.74
    Act Density 0.014%

    No Known Activations