INDEX
    Explanations

    mentions of the word "More" with emphasis

    references to increased quantities or amounts, often indicating an enhancement or additional content

    New Auto-Interp
    Negative Logits
    iku
    -0.71
    keye
    -0.61
    opian
    -0.59
    tein
    -0.59
    anka
    -0.58
    rained
    -0.58
    ogene
    -0.57
     equilibrium
    -0.57
    itude
    -0.56
    onite
    -0.55
    POSITIVE LOGITS
     More
    3.10
    More
    1.99
     Less
    1.71
    more
    1.67
     MORE
    1.66
     Few
    1.42
     Further
    1.41
     Much
    1.36
    MORE
    1.32
     Nearly
    1.30
    Act Density 0.015%

    No Known Activations