INDEX
Explanations
words related to stoves and cooking appliances
references to cooking appliances and utensils
New Auto-Interp
Negative Logits
Byrne
-0.83
aird
-0.70
rity
-0.68
oral
-0.67
ados
-0.67
orns
-0.66
oria
-0.66
eme
-0.66
ually
-0.65
vectors
-0.65
POSITIVE LOGITS
stove
1.27
burner
1.18
cooker
1.14
pipe
0.99
sonian
0.93
kettle
0.88
oven
0.86
cook
0.86
grill
0.86
washer
0.84
Activations Density 0.023%