INDEX
    Explanations

    phrases that start with "While"

    occurrences of the word "While"

    New Auto-Interp
    Negative Logits
    atron
    -0.80
    rium
    -0.79
    aer
    -0.76
    omet
    -0.73
    iotic
    -0.72
    tnc
    -0.71
    enter
    -0.70
    onz
    -0.68
    ISE
    -0.67
    illet
    -0.67
    POSITIVE LOGITS
     acknowledging
    1.10
     researching
    0.95
     browsing
    0.94
     conced
    0.91
     discussing
    0.84
     respecting
    0.84
     commenting
    0.83
     agreeing
    0.83
     admitting
    0.78
     compiling
    0.77
    Act Density 0.038%

    No Known Activations