INDEX
    Explanations

    discussions about honesty and truthfulness in various contexts

    New Auto-Interp
    Negative Logits
    tagHelper
    -0.80
     Paglinawan
    -0.71
    expandindo
    -0.71
    #+#
    -0.70
    Kaynakça
    -0.68
     Италијани
    -0.68
    complexContent
    -0.66
    adaptiveStyles
    -0.62
    :+:
    -0.62
    vscode
    -0.62
    POSITIVE LOGITS
     truth
    2.07
     honesty
    1.95
     honest
    1.89
    truth
    1.79
     truthful
    1.71
    Truth
    1.71
     Truth
    1.69
     TRUTH
    1.67
    honest
    1.60
     truths
    1.54
    Act Density 0.388%

    No Known Activations