INDEX
Explanations
references to specific dishes and food-related experiences
New Auto-Interp
Negative Logits
ÃĹ↵↵
-0.19
Aviv
-0.17
جر
-0.15
بÙĪØ§Ø³Ø·Ø©
-0.15
Zur
-0.14
á¹
-0.14
">//
-0.14
Atl
-0.14
Sabha
-0.14
ä¸ĭ载次æķ°
-0.13
POSITIVE LOGITS
Chinese
0.35
China
0.34
Asian
0.31
Asia
0.29
Chinese
0.28
China
0.28
chinese
0.27
Mandarin
0.25
Asian
0.24
Asians
0.23
Activations Density 0.426%