INDEX
Explanations
the word "confident" in various contexts
expressions of trust or self-assurance
New Auto-Interp
Negative Logits
sites
-0.95
Vert
-0.77
ental
-0.76
pmwiki
-0.73
çĦ
-0.73
outed
-0.70
href
-0.68
apple
-0.68
adish
-0.67
ifling
-0.67
POSITIVE LOGITS
ially
0.93
confident
0.92
shorth
0.83
urances
0.81
assurance
0.81
assurances
0.81
enough
0.78
iated
0.78
challenger
0.77
confidence
0.77
Activations Density 0.012%