INDEX
    Explanations

    mentions of medical conditions or symptoms

    terms related to the concept of "out" or "being out," especially in various contexts

    New Auto-Interp
    Negative Logits
    ha
    -0.85
    Gra
    -0.82
    leigh
    -0.81
    gra
    -0.81
    hei
    -0.75
    angle
    -0.73
     helic
    -0.72
    lia
    -0.72
    hene
    -0.70
    ŃĶ
    -0.70
    POSITIVE LOGITS
     Out
    1.99
    Out
    1.89
     OUT
    1.83
    out
    1.80
     out
    1.72
    OUT
    1.71
     outs
    1.51
    outs
    1.51
     Outs
    1.41
    outed
    1.15
    Act Density 0.110%

    No Known Activations