INDEX
Explanations
terms related to medical conditions and their effects on health outcomes
preceding nouns/adjectives
both ancient and modern
New Auto-Interp
Negative Logits
}'.
-0.86
'].
-0.77
].
-0.77
}}$.
-0.76
"].
-0.76
])).
-0.75
\}.
-0.75
\}$.
-0.74
})));
-0.74
}]);
-0.73
POSITIVE LOGITS
كومونز
0.90
сылкі
0.76
ArgsConstructor
0.74
ConstraintMaker
0.67
OGND
0.67
WriteLiteral
0.67
iſt
0.66
Билгалдахарш
0.65
المعيارى
0.64
Kjelder
0.64
Activations Density 0.846%