INDEX
    Explanations

    references to the concept of truth and its implications in various contexts

    New Auto-Interp
    Negative Logits
    nonatomic
    -0.97
    Berikut
    -0.86
    AppMethodBeat
    -0.84
     gero
    -0.83
     cory
    -0.81
     Aras
    -0.80
     Garvey
    -0.79
     nahilalakip
    -0.79
     Gentry
    -0.76
    cenary
    -0.75
    POSITIVE LOGITS
    truth
    0.82
    lessly
    0.77
    esserung
    0.75
     lie
    0.74
     Truths
    0.74
     Truth
    0.74
     chalk
    0.73
     truths
    0.72
     lies
    0.72
     truth
    0.71
    Act Density 0.102%

    No Known Activations