INDEX
Explanations
references to sweetness or sweet-related terms
references to "sweet" and similar terms
New Auto-Interp
Negative Logits
Downloadha
-1.08
kson
-0.76
govtrack
-0.76
Administ
-0.67
ividual
-0.64
ineligible
-0.62
prohibited
-0.62
fielded
-0.61
agall
-0.61
senal
-0.61
POSITIVE LOGITS
heart
1.57
eners
1.52
ener
1.43
ened
1.42
ening
1.31
est
1.19
bread
1.17
potato
1.13
potatoes
1.02
water
0.96
Activations Density 0.029%