INDEX
Explanations
scientific terminology related to health and its effects on the human body
New Auto-Interp
Negative Logits
Germ
-0.15
idden
-0.14
dwell
-0.14
amu
-0.14
881
-0.14
riminator
-0.13
Gerry
-0.13
338
-0.13
acia
-0.13
arde
-0.13
POSITIVE LOGITS
Violation
0.16
appoint
0.15
GIT
0.15
زÙĨ
0.15
æĤ
0.15
emiz
0.15
ffe
0.15
normalization
0.14
تÙĤÙĪ
0.14
ÏĦεÏģο
0.14
Activations Density 0.055%