INDEX
    Explanations

    references to truth and its various interpretations

    New Auto-Interp
    Negative Logits
    ing
    -0.66
    o
    -0.65
    addPreferredGap
    -0.63
    ené
    -0.58
    ená
    -0.58
    pdev
    -0.57
     anjo
    -0.57
    dyn
    -0.56
    appName
    -0.56
     Redmond
    -0.55
    POSITIVE LOGITS
     Truth
    1.37
     TRUTH
    1.25
    Truth
    1.23
    truth
    1.22
     truth
    1.16
     truths
    1.15
     Truths
    1.13
    fulness
    0.91
     Wahrheit
    0.90
     Tahu
    0.89
    Act Density 0.005%

    No Known Activations