INDEX
Explanations
terminology related to medical and scientific concepts
New Auto-Interp
Negative Logits
els
-0.16
Fight
-0.16
atra
-0.14
PLICIT
-0.14
_BOTH
-0.14
owitz
-0.13
лÑĥÑĩ
-0.13
apat
-0.13
Äįan
-0.13
elves
-0.13
POSITIVE LOGITS
terms
0.31
terminology
0.30
gloss
0.28
Gloss
0.28
terms
0.27
Terms
0.27
Terms
0.25
termin
0.25
termin
0.25
Termin
0.25
Activations Density 0.104%