INDEX
    Explanations

    adjectives ending in "ive"

    New Auto-Interp
    Negative Logits
    ième
    -1.05
    adays
    -1.02
    er
    -1.01
    inghouse
    -0.97
    ing
    -0.95
    󠁿
    -0.95
    stood
    -0.94
    ergies
    -0.94
    تقاوى
    -0.93
     NSCoder
    -0.92
    POSITIVE LOGITS
     work
    0.54
     approach
    0.53
     ones
    0.52
     nature
    0.51
     human
    0.50
    </i>
    0.49
     issue
    0.49
     plan
    0.47
     condition
    0.47
     way
    0.47
    Act Density 0.070%

    No Known Activations