INDEX
    Explanations

    questions beginning with "How" or "What."

    New Auto-Interp
    Negative Logits
    ilon
    -0.17
    eger
    -0.16
     aggregate
    -0.15
    inis
    -0.15
    olland
    -0.14
    abant
    -0.14
    igor
    -0.14
     level
    -0.14
     exponential
    -0.14
    aggregate
    -0.14
    POSITIVE LOGITS
    utenberg
    0.16
    alli
    0.16
    .jquery
    0.15
    elmet
    0.14
    .Slf
    0.14
     ail
    0.14
    ãĥ£
    0.14
    $MESS
    0.14
    .SIZE
    0.14
    _fence
    0.13
    Act Density 0.017%

    No Known Activations