INDEX
    Explanations

    references to knowledge and understanding

    New Auto-Interp
    Negative Logits
    \<^
    -0.17
    istol
    -0.16
    ulant
    -0.16
    az
    -0.15
    go
    -0.15
    ross
    -0.15
    /group
    -0.14
    otts
    -0.14
    bons
    -0.14
    shaw
    -0.14
    POSITIVE LOGITS
    ably
    0.23
    fulness
    0.21
    fully
    0.19
    ledged
    0.18
    gable
    0.17
    285
    0.17
     about
    0.16
    ously
    0.16
     base
    0.16
    igne
    0.15
    Act Density 0.029%

    No Known Activations