INDEX
    Explanations

    concepts related to evolution and design

    This neuron appears to be activating on a highly diverse and seemingly unrelated set of tokens across different document types (philosophical discourse, Japanese media content, programming code, and economic articles), making it difficult to identify a single coherent pattern. However, examining the strongest activations reveals that the neu

    New Auto-Interp
    Negative Logits
    出版年
    -0.72
     AssemblyTitle
    -0.54
    الدراسه
    -0.48
    Personendaten
    -0.47
     propOrder
    -0.47
    MIDDLEWARE
    -0.47
    stateProvider
    -0.46
    Життєпис
    -0.45
    Followers
    -0.44
     errorCode
    -0.43
    POSITIVE LOGITS
     typing
    0.42
     Luc
    0.41
    ApiModelProperty
    0.40
    mobileqq
    0.39
    0.37
     evolve
    0.36
    Luc
    0.36
     evolves
    0.35
     typed
    0.35
     Typing
    0.35
    Act Density 0.099%

    No Known Activations