INDEX
    Explanations

    unconventional symbols or characters, as well as various conjunctions and words that create conditions or cause effects

    New Auto-Interp
    Negative Logits
     ketogenic
    -0.15
    2
    -0.14
    4
    -0.14
    3
    -0.14
    1
    -0.14
    0
    -0.13
    8
    -0.13
    PI
    -0.12
     womens
    -0.12
    7
    -0.12
    POSITIVE LOGITS
     whoever
    0.38
     somebody
    0.31
     Whoever
    0.30
     someone
    0.29
     whichever
    0.28
     whatever
    0.27
     said
    0.27
     THEY
    0.25
     SOM
    0.23
    Whatever
    0.23
    Act Density 0.482%

    No Known Activations