INDEX
Explanations
words related to cooking and recipes
references to cookware or cooking utensils
New Auto-Interp
Negative Logits
åŃ
-0.78
ij士
-0.78
IGH
-0.77
@#&
-0.74
GEAR
-0.73
Seym
-0.70
çĭ
-0.67
ãĤī
-0.67
è¦
-0.65
marked
-0.65
POSITIVE LOGITS
ning
1.09
zees
0.95
eways
0.90
xual
0.85
olin
0.83
ned
0.80
psons
0.79
ahon
0.77
oche
0.76
pan
0.76
Activations Density 0.018%