INDEX
Explanations
references to leadership qualities and professional expertise
New Auto-Interp
Negative Logits
âĪı
-0.19
labour
-0.17
cie
-0.17
connexion
-0.17
ulia
-0.17
Fortune
-0.16
avourite
-0.16
beaut
-0.15
honour
-0.15
honoured
-0.15
POSITIVE LOGITS
à¹Ĩ
0.17
Armor
0.15
pj
0.15
embedding
0.15
dementia
0.14
906
0.14
Armor
0.14
.Magenta
0.14
frag
0.14
Capability
0.14
Activations Density 0.152%