INDEX
    Explanations

    proper names, particularly the name "Rebecca 9"

    New Auto-Interp
    Negative Logits
    <bos>
    -0.79
    
    
    -0.67
    ByVersion
    -0.63
    <?
    -0.61
    SequentialGroup
    -0.55
    -0.55
    qiang
    -0.54
     onSave
    -0.53
    обеди
    -0.52
    énégal
    -0.52
    POSITIVE LOGITS
     Rebecca
    1.09
     véhic
    1.01
    Rebecca
    1.00
     kac
    0.97
     silikon
    0.94
     lele
    0.92
     jaya
    0.91
     affor
    0.91
     saba
    0.91
     aussitôt
    0.91
    Act Density 0.284%

    No Known Activations