INDEX
Explanations
instances of the word "honest" and related concepts indicating sincerity and transparency
honest / honestly
New Auto-Interp
Negative Logits
ppuden
-0.48
Mayfield
-0.45
DataItem
-0.45
Garuda
-0.45
Barrington
-0.45
Raiders
-0.44
Baran
-0.44
Raider
-0.44
Barry
-0.44
ViewGroup
-0.43
POSITIVE LOGITS
Honest
0.94
Honest
0.93
honest
0.90
honest
0.81
Honesty
0.80
honnête
0.79
honesty
0.71
hones
0.66
dishonest
0.65
<bos>
0.63
Activations Density 0.006%