INDEX
    Explanations

    instances of the word "honest" and related concepts indicating sincerity and transparency

    New Auto-Interp
    Negative Logits
    ppuden
    -0.48
     Mayfield
    -0.45
    DataItem
    -0.45
     Garuda
    -0.45
     Barrington
    -0.45
     Raiders
    -0.44
     Baran
    -0.44
     Raider
    -0.44
     Barry
    -0.44
    ViewGroup
    -0.43
    POSITIVE LOGITS
     Honest
    0.94
    Honest
    0.93
    honest
    0.90
     honest
    0.81
     Honesty
    0.80
     honnête
    0.79
     honesty
    0.71
     hones
    0.66
     dishonest
    0.65
    <bos>
    0.63
    Act Density 0.006%

    No Known Activations