INDEX
    Explanations

    sentences related to discussing the methodology or assumptions of logical reasoning and academic arguments

    New Auto-Interp
    Negative Logits
     Boot
    -0.64
     Inher
    -0.64
    dylib
    -0.63
     hail
    -0.61
     Strikes
    -0.60
     Jur
    -0.60
     Ital
    -0.60
    ebted
    -0.60
     Kut
    -0.59
     watches
    -0.59
    POSITIVE LOGITS
     preferable
    1.11
     counterproductive
    1.11
     fraught
    1.08
     advisable
    0.99
     frowned
    0.95
     futile
    0.95
     problematic
    0.94
     impractical
    0.94
     folly
    0.91
     daunting
    0.90
    Act Density 2.503%

    No Known Activations