INDEX
    Explanations

    mentions of the word "illicit" or variations thereof

    words related to affiliation and belonging

    New Auto-Interp
    Negative Logits
     Wonderland
    -0.65
     shining
    -0.62
     place
    -0.61
     royalties
    -0.60
     clipping
    -0.60
     dex
    -0.60
     Colorado
    -0.59
     finalized
    -0.59
     threshold
    -0.58
     Dread
    -0.58
    POSITIVE LOGITS
    ili
    4.53
    iliation
    2.06
    ilia
    1.96
    iliated
    1.78
    iliate
    1.73
    ilian
    1.71
    ilic
    1.52
    ile
    1.51
    iliary
    1.51
    illi
    1.49
    Act Density 0.006%

    No Known Activations