INDEX
    Explanations

    the name "Ryan" in the text

    New Auto-Interp
    Negative Logits
    <bos>
    -2.91
    -1.07
    
    
    -1.02
    <?
    -0.99
    /**
    -0.94
    /***
    
    -0.87
    <?
    
    -0.79
    /*
    -0.75
    ///**
    -0.66
    AutoScaleMode
    -0.65
    POSITIVE LOGITS
     Ryan
    1.29
    Ryan
    1.25
     ryan
    1.14
     cæ
    0.82
     Juillet
    0.81
     pank
    0.80
     saar
    0.79
    ryan
    0.79
     silikon
    0.79
     Bibl
    0.79
    Act Density 0.073%

    No Known Activations