INDEX
    Explanations

    mentions of nicknames and names

    New Auto-Interp
    Negative Logits
     }}"></
    -0.67
    )");
    
    -0.66
     ]
    
    -0.64
    "}>
    -0.62
    WithIOException
    -0.62
    "]}
    -0.61
     }</
    -0.60
    ]})
    -0.60
    "})
    -0.59
    "]];
    -0.59
    POSITIVE LOGITS
     nickname
    2.51
     name
    2.26
     nicknames
    2.14
     moniker
    2.00
     surname
    1.95
     names
    1.94
     renaming
    1.94
     NAME
    1.93
     rename
    1.93
     naming
    1.88
    Act Density 1.661%

    No Known Activations