INDEX
    Explanations

    questions and prompts for information

    questions and interrogative sentences

    New Auto-Interp
    Negative Logits
    igans
    -0.83
    atten
    -0.71
    oling
    -0.68
    oche
    -0.67
    pointers
    -0.67
    iple
    -0.66
    rawdownloadcloneembedreportprint
    -0.66
    ryu
    -0.64
    zin
    -0.64
    tin
    -0.63
    POSITIVE LOGITS
     Explain
    0.85
     What
    0.76
     Close
    0.75
     Why
    0.75
     Previous
    0.75
     Would
    0.73
     How
    0.73
     Does
    0.73
     Who
    0.71
     Disability
    0.70
    Act Density 0.047%

    No Known Activations