INDEX
    Explanations

    the word 'Iron' followed by another word

    mentions of "Iron" in various contexts

    New Auto-Interp
    Negative Logits
    uated
    -0.81
    arre
    -0.75
    enance
    -0.74
    soType
    -0.72
     CLS
    -0.66
    ortion
    -0.66
    itia
    -0.66
    ired
    -0.65
    igate
    -0.64
    uates
    -0.64
    POSITIVE LOGITS
    clad
    1.14
     ore
    0.91
     axe
    0.86
     marrow
    0.84
     Iron
    0.83
     Age
    0.81
    works
    0.80
    forge
    0.79
    mong
    0.78
    claw
    0.76
    Act Density 0.007%

    No Known Activations