INDEX
Explanations
terms related to iron or iron-related concepts
references to iron in various contexts
New Auto-Interp
Negative Logits
earch
-0.76
renheit
-0.74
soType
-0.74
adows
-0.74
anchester
-0.70
Ń·
-0.70
lus
-0.69
Retrieved
-0.69
Garr
-0.69
ORTS
-0.69
POSITIVE LOGITS
clad
1.33
ore
1.07
iron
1.00
oxide
0.99
skillet
0.93
chest
0.91
fist
0.89
rod
0.88
works
0.87
worm
0.85
Activations Density 0.008%