INDEX
    Explanations

    the presence of the word "create" in various forms and contexts

    New Auto-Interp
    Negative Logits
     '
    -0.55
     und
    -0.54
     For
    -0.51
     and
    -0.51
     sel
    -0.48
     likely
    -0.48
    ,
    -0.46
     in
    -0.45
    elf
    -0.45
    plained
    -0.44
    POSITIVE LOGITS
    create
    2.02
    CREATE
    1.03
     creation
    1.01
     متعلقه
    0.94
    creates
    0.94
    creation
    0.93
     create
    0.92
    creat
    0.92
     creazione
    0.92
    Create
    0.90
    Act Density 0.083%

    No Known Activations