INDEX
    Explanations

    code that involves outputting messages or printing to the console

    New Auto-Interp
    Negative Logits
    ipo
    -0.17
    ue
    -0.15
    ights
    -0.15
     familiar
    -0.15
    otre
    -0.15
    et
    -0.15
    arie
    -0.14
    utor
    -0.14
    922
    -0.14
    camp
    -0.14
    POSITIVE LOGITS
    .println
    0.39
    println
    0.26
    .print
    0.22
     println
    0.21
    -print
    0.20
     prints
    0.19
    .Println
    0.18
    .WriteLine
    0.18
     print
    0.18
    	println
    0.18
    Act Density 0.008%

    No Known Activations