INDEX
    Explanations

    instances where concepts are being proposed or questioned, particularly those that may have implications or benefits

    New Auto-Interp
    Negative Logits
    atak
    -0.16
    isen
    -0.16
    uez
    -0.16
    alace
    -0.15
    isbury
    -0.15
    .Undef
    -0.14
    IRQ
    -0.14
    andler
    -0.14
    irs
    -0.14
    iefs
    -0.14
    POSITIVE LOGITS
     option
    0.23
     possibility
    0.21
     prospect
    0.21
     idea
    0.21
     notion
    0.20
     issue
    0.19
     task
    0.19
     proposition
    0.18
     added
    0.17
     question
    0.16
    Act Density 0.186%

    No Known Activations