INDEX
    Explanations

    locations and proximity references in the text

    New Auto-Interp
    Head Attr Weights
    0:0.10
    1:0.02
    2:0.09
    3:0.10
    4:0.02
    5:0.11
    6:0.04
    7:0.07
    8:0.15
    9:0.06
    10:0.11
    11:0.06
    Negative Logits
     adaptation
    -1.10
     Niet
    -1.06
     POV
    -1.06
     rewriting
    -1.03
     subtitles
    -1.01
     Weird
    -1.00
     Spock
    -1.00
     adapt
    -0.99
     Canaver
    -0.98
     behav
    -0.98
    POSITIVE LOGITS
    avia
    1.28
    arma
    1.28
    una
    1.23
    waters
    1.23
    sun
    1.21
    istar
    1.20
    union
    1.19
    ropolis
    1.19
    init
    1.18
    hess
    1.17
    Act Density 0.039%

    No Known Activations