INDEX
    Explanations

    phrases indicating choices or alternatives

    the word "or" used in various contexts

    New Auto-Interp
    Negative Logits
    ernels
    -0.95
    lees
    -0.91
    flows
    -0.81
    doms
    -0.80
     Attacks
    -0.78
     Accounts
    -0.77
    kins
    -0.76
    ouses
    -0.76
    Rs
    -0.76
    akes
    -0.75
    POSITIVE LOGITS
     bracelet
    0.94
     charger
    0.91
     pamphlet
    0.88
     a
    0.86
    chard
    0.85
     two
    0.84
     necklace
    0.84
     thinker
    0.84
     proposition
    0.84
     piece
    0.84
    Act Density 0.185%

    No Known Activations