INDEX
    Explanations

    phrases related to improvement and critique

    preceding "improvement" or "improve"

    New Auto-Interp
    Negative Logits
    しまう
    -0.42
    fortawesome
    -0.41
     lệ
    -0.40
    -0.40
    しまいます
    -0.37
    OSE
    -0.36
    liert
    -0.36
    ю
    -0.36
    -0.35
     setuptools
    -0.35
    POSITIVE LOGITS
     improvement
    2.80
     improvements
    2.63
     Improvement
    2.51
    improvement
    2.50
    improve
    2.41
    Improvement
    2.39
     Improvements
    2.38
     IMPROVEMENT
    2.33
     improve
    2.33
    Improvements
    2.32
    Act Density 0.420%

    No Known Activations