INDEX
Explanations
specific descriptors related to physical appearance or characteristics
New Auto-Interp
Negative Logits
osten
-0.17
ouz
-0.15
hong
-0.15
uppen
-0.14
bservice
-0.14
à¹Ģà¸ŀล
-0.14
åĺī
-0.14
-readable
-0.14
icens
-0.14
ixer
-0.14
POSITIVE LOGITS
less
0.23
-less
0.20
ted
0.20
ored
0.20
-bordered
0.20
-equipped
0.19
ged
0.18
bed
0.17
-haired
0.17
oured
0.17
Activations Density 0.143%