INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     nowrap
    -0.06
     Welfare
    -0.06
     Cap
    -0.06
    .coll
    -0.06
    087
    -0.06
    _nl
    -0.06
    setProperty
    -0.06
     Pollution
    -0.06
    -consuming
    -0.06
    017
    -0.06
    POSITIVE LOGITS
     team
    0.08
     classes
    0.07
     gadgets
    0.07
    0.07
    .urls
    0.07
     teammates
    0.07
    tribute
    0.07
    VES
    0.07
     Baseball
    0.06
    =tf
    0.06
    Act Density 0.003%

    No Known Activations