INDEX
Explanations
This neuron detects mentions of “trade-offs” (or closely related hyphenated terms indicating balancing competing factors).
New Auto-Interp
Negative Logits
(ERROR
-0.07
Fowler
-0.06
JNIEnv
-0.06
//
-0.06
test
-0.06
CWE
-0.06
snippet
-0.06
occurrence
-0.06
_ORDER
-0.06
manner
-0.06
POSITIVE LOGITS
rng
0.07
Savaşı
0.07
олаг
0.07
Repositories
0.07
ри
0.07
TTC
0.06
сом
0.06
토
0.06
ژ
0.06
busc
0.06
Activations Density 0.005%