INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ivari
    -0.76
    ivating
    -0.74
    CVE
    -0.74
    inen
    -0.73
    chenko
    -0.71
    oren
    -0.71
    nings
    -0.71
    IFA
    -0.70
    nesota
    -0.69
    alez
    -0.69
    POSITIVE LOGITS
     cardboard
    1.12
     boxes
    1.12
     box
    1.00
     wrapper
    0.86
     tubes
    0.86
     tube
    0.84
     sleeves
    0.83
     containers
    0.82
     packaging
    0.81
     crates
    0.80
    Act Density 0.003%

    No Known Activations