INDEX
Explanations
references to "crowds" or "crowd-related" terminology
New Auto-Interp
Negative Logits
ugs
-0.16
Guys
-0.15
ockets
-0.14
nick
-0.14
asc
-0.14
onym
-0.14
Transport
-0.14
PN
-0.14
bron
-0.14
exc
-0.13
POSITIVE LOGITS
dfunding
0.29
ther
0.24
Wing
0.22
wing
0.22
ning
0.21
thers
0.20
ded
0.20
Crow
0.20
crow
0.19
ding
0.19
Activations Density 0.010%