INDEX
Explanations
mentions of fashion, clothing, and social issues related to wealth disparity
New Auto-Interp
Negative Logits
både
-0.15
oute
-0.15
ainty
-0.14
оÑĥ
-0.14
.ms
-0.14
ota
-0.14
MORE
-0.14
yro
-0.14
esy
-0.14
sogar
-0.14
POSITIVE LOGITS
nothing
0.18
nothing
0.18
occasional
0.17
immediate
0.15
thôi
0.14
SizeMode
0.14
Nothing
0.14
occasionally
0.14
ãĥ³ãĥģ
0.14
Nothing
0.14
Activations Density 0.169%