INDEX
    Explanations

    terms related to growth or increase in scale

    New Auto-Interp
    Negative Logits
    fully
    -0.17
    Æł
    -0.16
    anners
    -0.16
    ialized
    -0.16
    lessly
    -0.16
    zelf
    -0.15
    erman
    -0.15
    plash
    -0.15
    utow
    -0.15
    ourney
    -0.15
    POSITIVE LOGITS
     upon
    0.27
     Upon
    0.21
     into
    0.19
    Upon
    0.18
    /import
    0.17
     hor
    0.17
    able
    0.15
    ary
    0.15
    /general
    0.15
    avier
    0.15
    Act Density 0.033%

    No Known Activations