INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Becker
    -0.07
     mark
    -0.07
     stair
    -0.07
     yellow
    -0.07
     Balt
    -0.06
    opleft
    -0.06
     mak
    -0.06
     Roma
    -0.06
     Layout
    -0.06
     A
    -0.06
    POSITIVE LOGITS
     infusion
    0.12
     infused
    0.08
     inf
    0.08
    umatic
    0.07
    isu
    0.07
    IFS
    0.07
     admir
    0.07
     Inf
    0.07
    fusion
    0.07
    buie
    0.07
    Act Density 0.005%

    No Known Activations