INDEX
    Explanations

    negations or terms indicating the absence of something

    New Auto-Interp
    Negative Logits
    allery
    -0.17
    .Undef
    -0.16
    izzo
    -0.15
    ilden
    -0.15
    ucose
    -0.15
    ças
    -0.15
     prim
    -0.14
    DataExchange
    -0.14
    /shared
    -0.14
    .AutoScale
    -0.13
    POSITIVE LOGITS
    REA
    0.17
    odel
    0.15
     Kramer
    0.15
    icer
    0.14
    iams
    0.14
    ively
    0.14
    aji
    0.14
     Bair
    0.14
    conti
    0.14
    afa
    0.14
    Act Density 0.117%

    No Known Activations