INDEX
    Explanations

    religious terms or references

    references to relatives or family relationships

    New Auto-Interp
    Negative Logits
     wide
    -0.71
     Lead
    -0.65
     wider
    -0.60
     thinly
    -0.60
     Slate
    -0.60
     slate
    -0.59
     buck
    -0.59
     Marketplace
    -0.57
     Wilde
    -0.57
     Swan
    -0.56
    POSITIVE LOGITS
    igion
    1.70
    iability
    1.50
    iable
    1.47
    atively
    1.42
    igious
    1.39
    atives
    1.36
    iever
    1.36
    iance
    1.35
    ativity
    1.33
    aunch
    1.33
    Act Density 0.029%

    No Known Activations