INDEX
    Explanations

    response beginning with "Okay, let's"

    New Auto-Interp
    Negative Logits
     faiblement
    0.72
     మాత్ర
    0.67
     secondes
    0.64
     hollow
    0.62
     cylindrical
    0.61
    如下
    0.61
     weakly
    0.60
     hereinafter
    0.58
     powerless
    0.58
     raczej
    0.58
    POSITIVE LOGITS
    So
    1.07
     so
    1.06
     So
    1.03
    Good
    0.90
    so
    0.86
     Good
    0.81
    Excellent
    0.79
     good
    0.78
    0.77
     excellent
    0.74
    Act Density 0.344%

    No Known Activations