INDEX
    Explanations

    concepts related to ideas and their development

    New Auto-Interp
    Negative Logits
    ĶåĽŀ
    -0.15
    iens
    -0.15
    oeff
    -0.15
    ssp
    -0.15
    _processors
    -0.15
    pras
    -0.15
    оÑĢоÑĤ
    -0.14
    ãĥ³ãĥĩãĤ£
    -0.14
    ytt
    -0.14
    evenodd
    -0.14
    POSITIVE LOGITS
     idea
    0.59
    idea
    0.52
     Idea
    0.49
     concept
    0.49
     ideas
    0.39
    concept
    0.38
     Concept
    0.36
    ideas
    0.35
     concepts
    0.33
     Ideas
    0.32
    Act Density 0.277%

    No Known Activations