INDEX
    Explanations

    concepts related to life, health, and well-being

    New Auto-Interp
    Negative Logits
    -induced
    -0.19
    olls
    -0.16
    NotificationCenter
    -0.16
    657
    -0.15
    çĸ²
    -0.15
     contrad
    -0.14
    ampa
    -0.14
     indirect
    -0.14
    azu
    -0.14
     severe
    -0.14
    POSITIVE LOGITS
    -saving
    0.23
    -threatening
    0.22
     enrich
    0.22
    -confirm
    0.21
    -transform
    0.21
    -limit
    0.21
     stabil
    0.20
    -changing
    0.20
     MODIFY
    0.19
    -def
    0.19
    Act Density 0.266%

    No Known Activations