INDEX
Explanations
terms related to founding members and leadership roles
New Auto-Interp
Negative Logits
idge
-0.18
yz
-0.17
oline
-0.17
ç¹Ķ
-0.17
Foundation
-0.16
phalt
-0.15
ameleon
-0.15
å¢ĥ
-0.15
oking
-0.14
/notification
-0.14
POSITIVE LOGITS
fathers
0.27
Fathers
0.23
father
0.18
father
0.17
/original
0.17
ry
0.17
lay
0.17
Father
0.16
-member
0.16
lation
0.16
Activations Density 0.020%