INDEX
Explanations
references to medical professionals, particularly doctors and physicians
New Auto-Interp
Negative Logits
gers
-0.17
ties
-0.17
ters
-0.16
ted
-0.16
ithe
-0.16
lish
-0.15
enser
-0.15
ạng
-0.15
ulas
-0.15
cone
-0.15
POSITIVE LOGITS
/engine
0.19
/ph
0.17
ial
0.16
iginal
0.15
imd
0.14
unction
0.14
ëŀľëĵľ
0.14
à¸į
0.13
ship
0.13
åģĩ
0.13
Activations Density 0.023%