INDEX
Explanations
mentions of resources or the need for resources in contexts such as technology, healthcare/counseling, and economic/political situations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1870
+0.14
0.5%
1763
+0.12
0.4%
1472
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
990
+0.14
0.02
1140
+0.12
0.02
1763
+0.11
0.02
Negative Logits
aboli
-0.57
SneakyThrows
-0.55
felipe
-0.54
salvare
-0.53
comis
-0.51
Siria
-0.51
SUDOC
-0.51
Valentín
-0.51
josé
-0.50
invita
-0.50
POSITIVE LOGITS
resources
1.42
resource
1.38
Resources
1.30
Resource
1.23
resources
1.22
RESOURCES
1.21
RESOURCE
1.20
resource
1.19
Resources
1.16
Resource
1.12
Activations Density 0.074%