INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    improved
    -0.60
     améli
    -0.59
     Improving
    -0.58
     Improve
    -0.56
     réguli
    -0.56
    améli
    -0.56
    improve
    -0.55
     amélior
    -0.54
    jago
    -0.54
    Improved
    -0.54
    POSITIVE LOGITS
     outcomes
    0.68
    IsContent
    0.60
     outcome
    0.59
     neck
    0.52
     pitch
    0.50
     Outcomes
    0.49
    商品説明
    0.49
     satisfaction
    0.48
     survi
    0.47
     mouth
    0.47
    Act Density 0.075%

    No Known Activations