INDEX
Explanations
references to food storage and preparation
New Auto-Interp
Negative Logits
utor
-0.16
rake
-0.15
iram
-0.14
623
-0.14
_WE
-0.14
ackage
-0.14
uro
-0.14
пал
-0.14
xit
-0.13
Fireplace
-0.13
POSITIVE LOGITS
fridge
0.59
refrigerator
0.56
refriger
0.52
Refriger
0.48
freezer
0.47
ÑħолодилÑĮ
0.41
Frid
0.38
fr
0.37
refr
0.36
Fr
0.32
Activations Density 0.126%