INDEX
Explanations
words related to dietary restrictions and medical conditions, particularly concerning gluten and dairy
negative health-related terms or conditions
New Auto-Interp
Negative Logits
heights
-0.87
succession
-0.80
patience
-0.79
clocks
-0.78
redund
-0.78
square
-0.75
step
-0.73
execut
-0.71
salaries
-0.70
twenties
-0.70
POSITIVE LOGITS
related
1.76
induced
1.74
containing
1.67
resistant
1.66
based
1.66
derived
1.63
free
1.63
associated
1.62
laden
1.60
specific
1.57
Activations Density 0.065%