INDEX
Explanations
instances of the word "honest"
phrases related to honesty and candidness
New Auto-Interp
Negative Logits
Topic
-0.72
idelines
-0.68
main
-0.68
atis
-0.65
strand
-0.64
assemblies
-0.63
cellaneous
-0.62
Included
-0.62
plank
-0.61
RESULTS
-0.60
POSITIVE LOGITS
honestly
0.72
shame
0.65
Valiant
0.65
numb
0.63
ockey
0.63
nobody
0.63
cest
0.62
umber
0.62
couldn
0.61
cientious
0.60
Activations Density 0.163%