INDEX
Explanations
words related to sweetness or sweet characteristics
New Auto-Interp
Negative Logits
Downloadha
-1.16
Administ
-0.79
kson
-0.78
agher
-0.70
Occupations
-0.68
agall
-0.66
Uz
-0.65
senal
-0.65
Bakr
-0.64
NUM
-0.64
POSITIVE LOGITS
heart
1.45
eners
1.42
ened
1.34
ener
1.25
ening
1.17
bread
1.11
potato
1.11
est
1.08
sweet
0.96
potatoes
0.96
Activations Density 0.017%