INDEX
    Explanations

    instances where an action is noticed or observed

    articles and demonstratives

    New Auto-Interp
    Negative Logits
     Jagu
    -0.81
    grounds
    -0.80
    Contents
    -0.75
    Edit
    -0.70
     Amend
    -0.68
     observes
    -0.65
     ie
    -0.65
     assum
    -0.65
    align
    -0.64
    tests
    -0.64
    POSITIVE LOGITS
     few
    1.08
     lot
    1.06
     plethora
    1.05
     handful
    1.04
     glimpse
    1.03
     bunch
    1.01
     significant
    0.98
     multitude
    0.98
     huge
    0.98
     couple
    0.96
    Act Density 0.433%

    No Known Activations