INDEX
    Explanations

    true statements leading to contradictions

    New Auto-Interp
    Negative Logits
     потенциа
    0.47
    φέ
    0.46
    0.45
    element
    0.44
    ája
    0.44
    𝖗
    0.44
    का
    0.44
     Kald
    0.43
    готовка
    0.42
    0.42
    POSITIVE LOGITS
    inyin
    0.48
     spiritual
    0.47
     Spiritual
    0.44
    pgamma
    0.44
     sphing
    0.44
     preached
    0.43
     bilirubin
    0.43
     prophes
    0.43
     موسیقی
    0.43
    datatables
    0.42
    Act Density 0.006%

    No Known Activations