INDEX
Explanations
expressions of personal struggle and resilience
New Auto-Interp
Negative Logits
telefon
-0.15
оÑģп
-0.15
eniable
-0.15
ucer
-0.14
chá»ĭu
-0.14
uido
-0.14
575
-0.14
strain
-0.14
iben
-0.14
evin
-0.13
POSITIVE LOGITS
others
0.25
Others
0.23
sharing
0.22
Others
0.21
others
0.20
Sharing
0.18
744
0.17
fellow
0.17
spread
0.16
useful
0.16
Activations Density 0.273%