INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     astr
    -0.07
     ASE
    -0.07
     anger
    -0.07
     CREATE
    -0.07
    (Call
    -0.07
     Luke
    -0.07
    Pause
    -0.07
     dah
    -0.07
    -0.06
     Arr
    -0.06
    POSITIVE LOGITS
     specimens
    0.12
     specimen
    0.11
     jeux
    0.08
    0.08
     './
    0.07
    imens
    0.07
    .repaint
    0.07
     bullpen
    0.07
    0.07
     Fixture
    0.07
    Act Density 0.003%

    No Known Activations