INDEX
Explanations
expressions of honesty or frankness in opinions
New Auto-Interp
Negative Logits
aguya
-0.71
GEBURTS
-0.65
ódź
-0.64
ouncil
-0.63
pach
-0.62
americas
-0.61
:✨
-0.60
lemmer
-0.60
=’
-0.59
archiviato
-0.59
POSITIVE LOGITS
honestly
1.00
frankly
0.97
Honestly
0.94
Frankly
0.93
Honestly
0.90
honestly
0.84
tbh
0.70
honest
0.70
ScopeManager
0.69
honn
0.68
Activations Density 0.099%