INDEX
Explanations
discussions about transparency and honesty in communication
New Auto-Interp
Negative Logits
ledes
-0.57
Magee
-0.53
menschen
-0.49
autorelease
-0.48
Vella
-0.48
pisang
-0.47
rdı
-0.46
jurí
-0.45
Purdy
-0.45
vscode
-0.45
POSITIVE LOGITS
disclosing
0.89
revealing
0.87
disclosure
0.86
disclose
0.83
transparency
0.82
disclosures
0.79
discloses
0.78
honesty
0.77
frank
0.76
openly
0.76
Activations Density 0.228%