INDEX
    Explanations

    "Could you" followed by a request

    New Auto-Interp
    Negative Logits
     estamos
    0.38
     estábamos
    0.36
     இனி
    0.36
     estaremos
    0.35
     Estamos
    0.34
    irish
    0.34
     klingt
    0.34
     currentPage
    0.34
    0.34
    多かった
    0.33
    POSITIVE LOGITS
     please
    0.81
    给我
    0.80
     explain
    0.77
     pls
    0.74
     give
    0.72
     help
    0.70
    給我
    0.70
     provide
    0.70
    帮忙
    0.69
     пожалуйста
    0.68
    Act Density 0.017%

    No Known Activations