INDEX
    Explanations

    references to the concept of truth and its implications

    New Auto-Interp
    Negative Logits
     AssemblyCompany
    -0.57
    yarnpkg
    -0.56
    Tikang
    -0.56
    tagHelperRunner
    -0.56
    føre
    -0.56
     تضيفلها
    -0.55
    ViewFeatures
    -0.54
    naje
    -0.54
    oblotting
    -0.53
     وتسجيلات
    -0.52
    POSITIVE LOGITS
    fulness
    0.84
     truths
    0.78
     TRUTH
    0.76
     Truth
    0.74
     Truths
    0.71
     lies
    0.71
     truth
    0.65
     lie
    0.64
    Truth
    0.64
     Efq
    0.63
    Act Density 0.129%

    No Known Activations