INDEX
    Explanations

    references to notable figures and their accomplishments

    New Auto-Interp
    Negative Logits
    ì¦
    -0.14
     thereby
    -0.14
    852
    -0.14
     or
    -0.13
    307
    -0.13
    749
    -0.13
    andbox
    -0.13
     ëͰëĿ¼
    -0.13
     thus
    -0.13
    éı
    -0.13
    POSITIVE LOGITS
     others
    0.51
    others
    0.40
     Others
    0.34
     etc
    0.32
     finally
    0.32
    Others
    0.31
    etc
    0.31
     countless
    0.27
     numerous
    0.25
    finally
    0.25
    Act Density 0.217%

    No Known Activations