INDEX
Explanations
This neuron detects mentions of monitoring or evaluating a deployed system’s performance.
New Auto-Interp
Negative Logits
raki
-0.07
.TryGetValue
-0.06
подум
-0.06
bigot
-0.06
husus
-0.06
[number
-0.06
墨
-0.06
upgrade
-0.06
uğra
-0.06
Fuse
-0.06
POSITIVE LOGITS
nutrient
0.07
cele
0.07
例
0.06
@@↵
0.06
alted
0.06
", ↵
0.06
Gaut
0.06
==↵
0.06
crossorigin
0.06
VENT
0.06
Activations Density 0.058%