INDEX
    Explanations

    phrases where the word "ORE" is mentioned along with a high activation value of 9 or 10

    New Auto-Interp
    Negative Logits
     sid
    -0.72
     angels
    -0.66
    st
    -0.64
     Bast
    -0.63
     Avalanche
    -0.62
     speeding
    -0.59
     Idlib
    -0.59
     Bec
    -0.58
    qu
    -0.58
     retrospect
    -0.58
    POSITIVE LOGITS
    ORE
    4.42
    ores
    2.18
    ore
    2.11
    ORED
    1.96
    ored
    1.49
    oring
    1.46
    OR
    1.39
    ORY
    1.37
    ALE
    1.35
    orer
    1.23
    Act Density 0.008%

    No Known Activations