INDEX
Explanations
cuisines from different countries
names of nationalities and cuisines
New Auto-Interp
Negative Logits
etsk
-0.89
ividual
-0.88
ueller
-0.81
utonium
-0.79
alions
-0.78
odder
-0.76
ashington
-0.74
iscons
-0.74
vable
-0.74
ertodd
-0.74
POSITIVE LOGITS
mystic
0.99
cuisine
0.96
monk
0.90
lantern
0.83
esque
0.82
delic
0.82
peasant
0.81
cooking
0.80
themed
0.79
istani
0.78
Activations Density 0.123%