INDEX
    Explanations

    references to the name "Peter."

    New Auto-Interp
    Negative Logits
    }]
    
    -0.99
    ————————————————
    -0.92
    "])
    
    -0.77
     }))
    -0.77
     برانيه
    -0.76
     gainera
    -0.74
    kull
    -0.73
     Schwe
    -0.73
     weft
    -0.73
    orthand
    -0.73
    POSITIVE LOGITS
     Iq
    0.96
    Cordialement
    0.94
    er
    0.91
     Argos
    0.84
     iq
    0.81
    ed
    0.80
     monkey
    0.78
    Iq
    0.77
    0.77
     näm
    0.77
    Act Density 0.538%

    No Known Activations