INDEX
    Explanations

    references to the concept of adaptation

    New Auto-Interp
    Negative Logits
    unk
    -0.44
    illion
    -0.42
    #
    -0.41
    <u>
    -0.41
    
    -0.39
    getNumber
    -0.39
    <b>
    -0.34
    <sup>
    -0.34
    Newswire
    -0.34
     ràng
    -0.34
    POSITIVE LOGITS
     adapt
    1.70
     adaptation
    1.66
    adapt
    1.59
     adapted
    1.58
     adapts
    1.56
    Adaptation
    1.55
    adaptation
    1.55
     Adaptation
    1.54
     Adapt
    1.53
     adapting
    1.52
    Act Density 0.189%

    No Known Activations