INDEX
Explanations
sugar and diabetes questions
New Auto-Interp
Negative Logits
emplea
0.36
anschauen
0.35
gustó
0.34
틀
0.34
蔑
0.34
verdienen
0.33
짰
0.33
евич
0.33
линд
0.33
某一
0.32
POSITIVE LOGITS
sugar
0.67
foods
0.66
blood
0.65
sugar
0.63
Foods
0.61
Sugar
0.60
diabetics
0.60
insulin
0.59
fasting
0.59
lowering
0.58
Activations Density 0.001%