INDEX
Explanations
words related to healthcare professions and roles
New Auto-Interp
Negative Logits
FAVOR
-0.76
FAVOR
-0.76
coloring
-0.72
ardor
-0.71
favor
-0.70
Afterward
-0.69
ighborhood
-0.69
flavors
-0.69
Neighbors
-0.68
watercolor
-0.68
POSITIVE LOGITS
]');
0.78
('');
0.67
});*/
0.63
})*/
0.63
>';
0.62
/>);
0.62
})));
0.62
]';
0.60
licence
0.59
}*/
0.59
Activations Density 1.517%