INDEX
Explanations
the word 'Iron' followed by another word
mentions of "Iron" in various contexts
New Auto-Interp
Negative Logits
uated
-0.81
arre
-0.75
enance
-0.74
soType
-0.72
CLS
-0.66
ortion
-0.66
itia
-0.66
ired
-0.65
igate
-0.64
uates
-0.64
POSITIVE LOGITS
clad
1.14
ore
0.91
axe
0.86
marrow
0.84
Iron
0.83
Age
0.81
works
0.80
forge
0.79
mong
0.78
claw
0.76
Activations Density 0.007%