INDEX
Explanations
particular order
The neuron detects list‐ordering disclaimers, i.e. the phrase “no particular order.”
New Auto-Interp
Negative Logits
ентів
-0.07
champion
-0.07
conce
-0.06
Ap
-0.06
hôn
-0.06
考
-0.06
.ToLower
-0.06
ompiler
-0.06
мож
-0.06
Ent
-0.06
POSITIVE LOGITS
Geographic
0.07
UIG
0.07
demographic
0.06
establishments
0.06
intra
0.06
)__
0.06
хорош
0.06
.invoke
0.06
Firm
0.06
зи
0.06
Activations Density 0.001%