INDEX
Explanations
terms related to the immune system and immune responses
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.23
1.4%
47
+0.13
0.8%
225
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
199
+0.23
0.01
446
+0.13
0.01
304
+0.12
0.01
Negative Logits
hots
-1.75
ables
-1.66
lla
-1.64
orrow
-1.61
aliana
-1.54
else
-1.51
afers
-1.47
ateur
-1.46
owing
-1.46
als
-1.46
POSITIVE LOGITS
microenvironment
1.64
differently
1.64
system
1.54
compartment
1.54
responses
1.50
defenses
1.48
response
1.46
checkpoint
1.43
clearance
1.41
mediated
1.39
Activations Density 0.045%