INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     spont
    -0.72
     drawer
    -0.66
     gag
    -0.65
     vanity
    -0.62
     rigs
    -0.61
     appe
    -0.60
    wagen
    -0.59
     peg
    -0.59
     everyday
    -0.59
     crude
    -0.58
    POSITIVE LOGITS
    actionDate
    0.95
     ][
    0.84
    ĸļ
    0.77
    taboola
    0.76
    à¼
    0.76
    align
    0.76
    ::::::::
    0.73
    ojure
    0.72
    mosp
    0.70
    lement
    0.70
    Act Density 0.782%

    No Known Activations