INDEX
    Explanations

    citations and references to source material

    New Auto-Interp
    Negative Logits
    orough
    -0.15
    tz
    -0.15
     Manus
    -0.14
     Wit
    -0.14
    cripts
    -0.14
    éĩ
    -0.14
    ÏģοÏį
    -0.14
    599
    -0.14
    æĽ
    -0.13
     afford
    -0.13
    POSITIVE LOGITS
     adapted
    0.40
    adapt
    0.37
     courtesy
    0.37
     Adapt
    0.36
    taken
    0.33
     taken
    0.30
    (Source
    0.30
     source
    0.29
     Source
    0.28
     adaptation
    0.28
    Act Density 0.269%

    No Known Activations