INDEX
    Explanations

    references to individuals named "Robert."

    "Robert" followed by names/surnames

    New Auto-Interp
    Negative Logits
     GenerationType
    -0.94
    ſelves
    -0.72
     purpoſe
    -0.69
     greateſt
    -0.68
     pleaſure
    -0.68
     Italijanski
    -0.67
    HasAnnotation
    -0.66
     reaſon
    -0.66
    ſelf
    -0.65
     fevere
    -0.64
    POSITIVE LOGITS
    bie
    0.67
    Manbalar
    0.53
    bi
    0.52
     Bob
    0.49
    ble
    0.49
    би
    0.47
    ました
    0.44
    Bob
    0.44
    0.44
    oward
    0.44
    Act Density 0.091%

    No Known Activations