INDEX
    Explanations

    sentences that summarize the essence or conclusion of a discussion

    New Auto-Interp
    Negative Logits
    ine
    -0.15
    uri
    -0.14
    rol
    -0.13
    æīįèĥ½
    -0.13
    600
    -0.13
    jes
    -0.13
     Nel
    -0.13
    ario
    -0.13
    reso
    -0.13
    reen
    -0.13
    POSITIVE LOGITS
     extent
    0.26
     extents
    0.20
    extent
    0.19
     ëģĿ
    0.19
    Extent
    0.18
     DONE
    0.17
    erdem
    0.17
    -all
    0.17
    .all
    0.17
     about
    0.16
    Act Density 0.041%

    No Known Activations