INDEX
    Explanations

    starting phrase completions

    New Auto-Interp
    Negative Logits
    .Formatter
    -0.16
    ¶Į
    -0.15
    ¦æĥħ
    -0.14
    ÂĢÂĢ
    -0.13
    įng
    -0.13
     -*-č\n
    -0.12
    ******č\n
    -0.12
    łéϤ
    -0.11
    ıa
    -0.11
    .Dictionary
    -0.11
    POSITIVE LOGITS
     )\n\n\n\n\n\n\n\n
    0.09
     Cher
    0.08
    ."\n\n\n\n
    0.08
     basically
    0.08
     ​​
    0.08
    âĢ¢
    0.08
    /
    0.07
    -
    0.07
    's
    0.07
     '(
    0.07
    Act Density 0.003%

    No Known Activations