INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    artifacts
    -0.79
    wcs
    -0.78
    minist
    -0.76
    mining
    -0.73
    aneously
    -0.72
    ombo
    -0.71
    different
    -0.70
    ital
    -0.70
    ibaba
    -0.70
    trop
    -0.70
    POSITIVE LOGITS
     Sons
    1.01
     Associates
    0.95
     Sharon
    0.90
     Kyl
    0.88
     Jerry
    0.87
     Michelle
    0.87
     Mary
    0.87
     Morty
    0.87
     Tammy
    0.86
     Tanner
    0.86
    Act Density 0.048%

    No Known Activations