INDEX
    Explanations

    references to specific items or topics, particularly those indicated with the definite article "the."

    New Auto-Interp
    Head Attr Weights
    0:0.02
    1:0.01
    2:0.11
    3:0.10
    4:0.22
    5:0.03
    6:0.13
    7:0.09
    8:0.04
    9:0.04
    10:0.06
    11:0.08
    Negative Logits
    MRI
    -1.50
    listed
    -1.50
    ounded
    -1.49
    allery
    -1.45
    ccording
    -1.43
     withd
    -1.39
    ocr
    -1.39
    livious
    -1.38
    エル
    -1.37
    ourage
    -1.35
    POSITIVE LOGITS
     advent
    1.76
     intervening
    1.61
     prevailing
    1.46
     uphe
    1.43
     Era
    1.42
     fact
    1.39
     raids
    1.35
     seasons
    1.34
    natureconservancy
    1.32
     onwards
    1.32
    Act Density 0.001%

    No Known Activations