INDEX
    Explanations

    references to the word "Wool"

    New Auto-Interp
    Negative Logits
     Aval
    -0.70
     Explain
    -0.69
     Hiroshima
    -0.68
     citation
    -0.67
     Oro
    -0.65
    CENT
    -0.65
     Cuomo
    -0.63
     INF
    -0.63
     Referred
    -0.62
     Activate
    -0.62
    POSITIVE LOGITS
    wich
    1.16
    sey
    1.08
    worth
    1.07
    wyn
    1.03
    awei
    1.02
    nesday
    0.97
    enn
    0.96
    worms
    0.96
    igan
    0.95
    isine
    0.94
    Act Density 0.004%

    No Known Activations