INDEX
Explanations
emotions expressed in personal experiences
New Auto-Interp
Negative Logits
himself
-0.17
妻
-0.17
leo
-0.15
ighton
-0.14
zÄħd
-0.14
Analyzer
-0.14
auen
-0.14
alim
-0.14
bsolute
-0.14
Unsigned
-0.14
POSITIVE LOGITS
ä¸Ī夫
0.22
herself
0.21
Ñģама
0.18
esh
0.15
Hao
0.15
pher
0.14
/div
0.14
Fav
0.14
publi
0.14
ová
0.14
Activations Density 2.524%