INDEX
Explanations
concepts related to ideas and their development
New Auto-Interp
Negative Logits
ĶåĽŀ
-0.15
iens
-0.15
oeff
-0.15
ssp
-0.15
_processors
-0.15
pras
-0.15
оÑĢоÑĤ
-0.14
ãĥ³ãĥĩãĤ£
-0.14
ytt
-0.14
evenodd
-0.14
POSITIVE LOGITS
idea
0.59
idea
0.52
Idea
0.49
concept
0.49
ideas
0.39
concept
0.38
Concept
0.36
ideas
0.35
concepts
0.33
Ideas
0.32
Activations Density 0.277%