INDEX
    Explanations

    words related to physical and mental wellbeing, sometimes in the context of social connection

    New Auto-Interp
    Negative Logits
    withstanding
    -1.42
    theless
    -1.34
    ization
    -1.31
    queous
    -1.21
    neath
    -1.17
    time
    -1.17
    imed
    -1.16
    ergies
    -1.16
    poptosis
    -1.16
    fraid
    -1.15
    POSITIVE LOGITS
    ponym
    0.57
    multirow
    0.55
    stø
    0.53
     disambiguazione
    0.52
    umpang
    0.50
     giuri
    0.50
    året
    0.49
    hofer
    0.49
    skab
    0.48
     stør
    0.48
    Act Density 1.460%

    No Known Activations