INDEX
    Explanations

    specific terms related to various societal topics and concepts

    terms related to scientific evidence and research

    New Auto-Interp
    Negative Logits
    tnc
    -0.55
     fame
    -0.52
    ãģ¾
    -0.49
    âĵĺ
    -0.46
     hers
    -0.44
    idden
    -0.43
    burning
    -0.42
    pron
    -0.42
     eternity
    -0.42
     respectively
    -0.41
    POSITIVE LOGITS
     Pwr
    0.50
    nces
    0.49
     citiz
    0.49
    bender
    0.49
    terness
    0.47
    etheless
    0.46
     Webs
    0.44
    anyahu
    0.43
     glim
    0.42
     QUEST
    0.41
    Act Density 4.966%

    No Known Activations