INDEX
Explanations
information related to animals, animal rights, and animal welfare
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1222
+0.15
0.6%
920
+0.14
0.5%
313
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
920
+0.15
0.03
1222
+0.14
0.03
421
+0.12
0.03
Negative Logits
rendono
-0.49
moreno
-0.49
cupa
-0.49
covite
-0.44
ícil
-0.44
habet
-0.43
Życiorys
-0.43
SEDS
-0.43
prada
-0.41
ureka
-0.41
POSITIVE LOGITS
animal
1.19
animals
1.11
animal
1.11
Animal
1.07
Animal
1.06
Animals
1.03
ANIMAL
0.97
animals
0.96
Animals
0.96
ANIMAL
0.89
Activations Density 0.095%