INDEX
Explanations
articles and descriptors related to people and their attributes
New Auto-Interp
Negative Logits
swer
-0.16
άνÏĦα
-0.15
omu
-0.15
elere
-0.14
ctrine
-0.14
astle
-0.14
crire
-0.14
ãĥ³ãĥĢ
-0.14
wang
-0.13
.crm
-0.13
POSITIVE LOGITS
man
0.14
Platt
0.14
ระ
0.14
UNET
0.14
odore
0.13
usch
0.13
evi
0.13
opleft
0.13
quila
0.13
Hol
0.13
Activations Density 0.105%