INDEX
    Explanations

    occurrences of the word "the"

    New Auto-Interp
    Negative Logits
    ameda
    -0.16
     bò
    -0.15
    +-+-+-+-+-+-+-+-
    -0.14
    urances
    -0.14
    yssey
    -0.14
    isher
    -0.14
     Loren
    -0.14
    utron
    -0.13
    ulus
    -0.13
     ustanov
    -0.13
    POSITIVE LOGITS
     world
    0.29
     planet
    0.29
     industry
    0.24
     history
    0.23
     universe
    0.23
     country
    0.22
     hemisphere
    0.22
    world
    0.21
     globe
    0.21
     entire
    0.21
    Act Density 0.041%

    No Known Activations