INDEX
Explanations
words related to honesty or openness
expressions related to honesty and openness in communication
New Auto-Interp
Negative Logits
Klu
-0.77
Ĥ¬
-0.76
PUT
-0.75
ammy
-0.69
ibur
-0.69
iggs
-0.68
IELD
-0.68
LV
-0.66
ĪĴ
-0.66
©¶æ
-0.66
POSITIVE LOGITS
candid
1.25
acies
1.07
ature
0.98
ness
0.85
iator
0.80
frank
0.79
furt
0.75
ly
0.75
alty
0.74
atures
0.73
Activations Density 0.009%