INDEX
    Explanations

    definite articles and their occurrences in context

    New Auto-Interp
    Negative Logits
     opportunity
    -0.18
     likes
    -0.17
     entire
    -0.16
     stuff
    -0.15
    atts
    -0.15
     avenue
    -0.15
     likeness
    -0.15
    人æīį
    -0.15
    äºĪ
    -0.14
     Entire
    -0.14
    POSITIVE LOGITS
     few
    0.44
    few
    0.36
     many
    0.33
     Few
    0.33
    Few
    0.31
    many
    0.30
    åĩłä¸ª
    0.27
     several
    0.25
     rare
    0.24
     MANY
    0.24
    Act Density 0.117%

    No Known Activations