INDEX
    Explanations

    references to sources or citations

    references to specific sources or citations

    New Auto-Interp
    Negative Logits
     whiff
    -0.70
    Interstitial
    -0.69
    uliffe
    -0.68
    ################
    -0.68
     deed
    -0.66
     Opera
    -0.65
    daq
    -0.65
    Frameworks
    -0.65
     Pebble
    -0.64
     heights
    -0.63
    POSITIVE LOGITS
    eree
    1.30
    erences
    1.24
    inement
    1.21
    actor
    1.13
    riger
    1.12
    erent
    1.11
    ractive
    1.10
    eren
    1.10
    erential
    1.09
    ined
    1.08
    Act Density 0.010%

    No Known Activations