INDEX
Explanations
words associated with food adulteration or contamination
New Auto-Interp
Head Attr Weights
0:0.03
1:0.03
2:0.06
3:0.04
4:0.05
5:0.04
6:0.09
7:0.19
8:0.05
9:0.06
10:0.24
11:0.06
Negative Logits
mentors
-1.85
Apostles
-1.71
Lessons
-1.71
applauded
-1.69
lightning
-1.67
pillars
-1.66
Transform
-1.64
magnification
-1.63
discipl
-1.61
laughter
-1.58
POSITIVE LOGITS
adul
2.07
upid
1.83
icides
1.83
ipop
1.81
achus
1.80
puter
1.77
plet
1.77
offending
1.75
avorite
1.74
ixture
1.73
Activations Density 0.000%