INDEX
    Explanations

    phrases related to health, medical interventions, and treatment effects

    New Auto-Interp
    Negative Logits
    omo
    -0.15
    uze
    -0.15
    wolf
    -0.14
    ora
    -0.13
    boru
    -0.13
    ì¹ĺ
    -0.13
     chim
    -0.13
    fx
    -0.13
    _uploaded
    -0.12
    deo
    -0.12
    POSITIVE LOGITS
     future
    0.37
    future
    0.29
     Future
    0.28
    Future
    0.27
    .future
    0.23
    .Future
    0.23
    _future
    0.23
    UTURE
    0.22
     futuro
    0.21
     further
    0.21
    Act Density 0.155%

    No Known Activations