INDEX
Explanations
words and phrases related to health and wellness, particularly focusing on medical issues and their impacts
New Auto-Interp
Negative Logits
ÙĥÙĬÙĬÙģ
-0.17
lland
-0.17
ivy
-0.16
ÙĦÙĥتر
-0.15
uyết
-0.15
antom
-0.15
antas
-0.15
ÙĪÛĮÙĨت
-0.14
ulas
-0.14
κε
-0.14
POSITIVE LOGITS
var
0.15
ż
0.15
ko
0.15
æ¶ī
0.14
âĤ¬
0.14
eline
0.14
actual
0.13
_RE
0.13
ob
0.13
relation
0.13
Activations Density 0.360%