INDEX
Explanations
phrases discussing medical conditions and procedures
New Auto-Interp
Negative Logits
antino
-0.17
olin
-0.15
bracht
-0.13
chl
-0.13
éĥ
-0.13
ợ
-0.13
Woodward
-0.13
htar
-0.12
utherford
-0.12
pector
-0.12
POSITIVE LOGITS
full
0.87
complete
0.86
fully
0.74
completo
0.72
completely
0.70
FULL
0.70
full
0.68
å®Įåħ¨
0.68
complete
0.67
COMPLETE
0.67
Activations Density 0.495%