INDEX
Explanations
terms related to law enforcement actions and their context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.32
1.2%
946
+0.14
0.5%
658
+0.10
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
946
+0.32
0.07
939
+0.14
0.07
658
+0.10
0.06
Negative Logits
<bos>
-2.94
//---
-0.62
/***
-0.61
truff
-0.59
//<
-0.56
harmonize
-0.54
<!--
-0.54
///**
-0.53
unify
-0.53
/*@
-0.52
POSITIVE LOGITS
Juf
0.94
Minang
0.94
Heeren
0.92
saar
0.91
Kün
0.87
tucson
0.86
Theile
0.86
Palembang
0.85
frankfurt
0.83
Heere
0.81
Activations Density 0.719%