INDEX
Explanations
phrases related to health and medical conditions, especially those related to diet and physical attributes
terms related to health, diet, and nutritional information
New Auto-Interp
Negative Logits
atz
-0.63
terday
-0.61
netflix
-0.51
cffffcc
-0.50
estern
-0.49
ovember
-0.48
DragonMagazine
-0.48
apego
-0.47
Watergate
-0.47
urches
-0.47
POSITIVE LOGITS
%.
0.72
'.
0.69
.
0.67
.(
0.65
.[
0.65
>.
0.64
$.
0.63
_.
0.62
.'
0.61
().
0.60
Activations Density 1.807%