INDEX
Explanations
personal pronouns and verbs related to interactions and relationships
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.11
0.3%
674
+0.11
0.3%
513
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.11
0.06
862
+0.11
0.03
769
+0.08
0.04
Negative Logits
Embeddable
-0.52
ModelMap
-0.44
getFloat
-0.42
Holstein
-0.41
Maestro
-0.41
CultureInfo
-0.40
dépens
-0.40
Explor
-0.40
RequestMethod
-0.39
getActivity
-0.39
POSITIVE LOGITS
jetta
0.78
chrysler
0.77
shayari
0.74
mondeo
0.72
pajero
0.69
voleva
0.66
regardant
0.65
camry
0.64
scrat
0.64
hilux
0.63
Activations Density 0.218%