INDEX
    Explanations

    phrases related to evidence or confirmation

    the term "proven" in various contexts related to validation or evidence

    New Auto-Interp
    Negative Logits
    adish
    -0.75
    letal
    -0.72
    ifle
    -0.72
    idays
    -0.69
    iewicz
    -0.67
     squats
    -0.66
    eeper
    -0.64
    umbn
    -0.64
    paio
    -0.64
    onductor
    -0.62
    POSITIVE LOGITS
    ãĥ¼ãĥĨ
    0.97
     proven
    0.92
    iary
    0.84
    س
    0.80
    ãĤ¤ãĥĪ
    0.78
     refuted
    0.78
    ingen
    0.78
    Ô
    0.75
     debunked
    0.75
    \\\\\\\\
    0.74
    Act Density 0.015%

    No Known Activations