INDEX
    Explanations

    phrases related to summaries and information sharing

    followed by a list or summary

    New Auto-Interp
    Negative Logits
    LookAnd
    -0.71
    medriver
    -0.65
     الرياضيه
    -0.65
     ddelweddau
    -0.61
    Історія
    -0.60
    writeFieldEnd
    -0.56
    glClear
    -0.56
    urlpatterns
    -0.55
    ̍t
    -0.53
    Économie
    -0.51
    POSITIVE LOGITS
     some
    1.42
     några
    1.11
     algunos
    1.10
     alguns
    1.09
     algunas
    1.08
     quelques
    1.08
     algumas
    1.05
    some
    1.04
     briefly
    1.03
    Some
    0.98
    Act Density 0.340%

    No Known Activations