INDEX
    Explanations

    elements related to academic discourse and structured arguments

    New Auto-Interp
    Negative Logits
    ourcing
    -0.15
     ISO
    -0.15
     Slim
    -0.15
     
    -0.15
    .hr
    -0.15
     wholes
    -0.14
     Ran
    -0.14
     sources
    -0.14
    ,
    -0.14
    lide
    -0.13
    POSITIVE LOGITS
     extensions
    0.38
     extension
    0.35
    extension
    0.31
    extensions
    0.31
     Extensions
    0.30
     continuation
    0.28
    Extensions
    0.28
     Extension
    0.27
     derivative
    0.27
    Extension
    0.26
    Act Density 0.237%

    No Known Activations