INDEX
    Explanations

    terms related to medical conditions and their effects on health outcomes

    preceding nouns/adjectives

    both ancient and modern

    New Auto-Interp
    Negative Logits
    }'.
    -0.86
    '].
    -0.77
    ].
    -0.77
    }}$.
    -0.76
    "].
    -0.76
    ])).
    -0.75
    \}.
    -0.75
    \}$.
    -0.74
    })));
    -0.74
    }]);
    -0.73
    POSITIVE LOGITS
     كومونز
    0.90
    сылкі
    0.76
    ArgsConstructor
    0.74
    ConstraintMaker
    0.67
    OGND
    0.67
    WriteLiteral
    0.67
     iſt
    0.66
    Билгалдахарш
    0.65
     المعيارى
    0.64
    Kjelder
    0.64
    Act Density 0.846%

    No Known Activations