INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     spirituality
    -0.07
     Stand
    -0.07
     count
    -0.07
    sexual
    -0.07
     Fitzgerald
    -0.07
    .Default
    -0.06
     preval
    -0.06
     phantom
    -0.06
    ження
    -0.06
    -0.06
    POSITIVE LOGITS
     jewel
    0.17
     jewels
    0.13
     Jewel
    0.13
    uilt
    0.08
    pers
    0.07
     oasis
    0.06
     cruel
    0.06
     Spl
    0.06
    reff
    0.06
    RESULT
    0.06
    Act Density 0.009%

    No Known Activations