INDEX
Explanations
words related to the concept of leadership or taking charge
New Auto-Interp
Negative Logits
اÙĤÙĦ
-0.14
Haj
-0.13
że
-0.13
rylic
-0.13
loth
-0.13
addCriterion
-0.13
ronym
-0.13
igan
-0.13
ucci
-0.12
rollable
-0.12
POSITIVE LOGITS
head
1.26
Head
1.18
head
1.15
heads
1.13
Head
1.10
-head
1.10
HEAD
1.04
heads
1.02
头
1.00
HEAD
0.98
Activations Density 0.372%