INDEX
Explanations
concepts related to evolution and design
This neuron appears to be activating on a highly diverse and seemingly unrelated set of tokens across different document types (philosophical discourse, Japanese media content, programming code, and economic articles), making it difficult to identify a single coherent pattern. However, examining the strongest activations reveals that the neu
New Auto-Interp
Negative Logits
出版年
-0.72
AssemblyTitle
-0.54
الدراسه
-0.48
Personendaten
-0.47
propOrder
-0.47
MIDDLEWARE
-0.47
stateProvider
-0.46
Життєпис
-0.45
Followers
-0.44
errorCode
-0.43
POSITIVE LOGITS
typing
0.42
Luc
0.41
ApiModelProperty
0.40
mobileqq
0.39
댓
0.37
evolve
0.36
Luc
0.36
evolves
0.35
typed
0.35
Typing
0.35
Activations Density 0.099%