INDEX
    Explanations

    words related to destruction or removal

    words associated with deprivation or lack

    New Auto-Interp
    Negative Logits
     Dragonbound
    -0.83
    nings
    -0.80
    ãĤ¤ãĥĪ
    -0.80
    FORE
    -0.79
     Hole
    -0.77
     Flavoring
    -0.75
    ãĥīãĥ©ãĤ´ãĥ³
    -0.74
     Millennium
    -0.73
    WORK
    -0.72
     Sandwich
    -0.71
    POSITIVE LOGITS
    utations
    1.12
    raved
    1.11
    utation
    1.05
    reci
    1.03
    ository
    1.01
    rec
    1.01
    ravity
    0.96
    orters
    0.96
    ugal
    0.94
    onent
    0.90
    Act Density 0.010%

    No Known Activations