INDEX
Explanations
references to the struggle for resources and the concept of scarcity
New Auto-Interp
Negative Logits
arel
-0.16
æ´¾
-0.15
gross
-0.15
psc
-0.15
ziel
-0.14
bote
-0.14
uled
-0.14
291
-0.14
anzi
-0.14
GT
-0.14
POSITIVE LOGITS
sam
0.16
IRCLE
0.15
fork
0.15
Colbert
0.15
اÙ쨱
0.14
ưng
0.14
uro
0.14
simp
0.14
apsible
0.14
pitch
0.14
Activations Density 0.018%