INDEX
Explanations
references to freshness in food or experiences
New Auto-Interp
Negative Logits
fresh
-0.24
Fresh
-0.23
fresh
-0.23
Fresh
-0.22
fres
-0.20
freshwater
-0.20
freshman
-0.19
freshness
-0.19
freshly
-0.18
freshmen
-0.17
POSITIVE LOGITS
ening
0.35
ened
0.34
eners
0.30
-faced
0.29
ener
0.28
-air
0.24
en
0.23
ens
0.23
/new
0.23
water
0.22
Activations Density 0.022%