INDEX
    Explanations

    phrases discussing hypocrisy in various contexts

    New Auto-Interp
    Negative Logits
    .Global
    -0.16
     éĢļ
    -0.15
    yor
    -0.14
     Yol
    -0.14
    eldom
    -0.14
    à¥įद
    -0.13
     sno
    -0.13
     feas
    -0.13
    ,copy
    -0.13
    ÏĥÏĦα
    -0.13
    POSITIVE LOGITS
     ENTRY
    0.15
     gratuiti
    0.15
     Entry
    0.14
    -entry
    0.14
    =\"";↵
    0.14
    gateway
    0.14
    adder
    0.13
    quine
    0.13
    aise
    0.13
    iyon
    0.13
    Act Density 0.660%

    No Known Activations