INDEX
Explanations
The neuron detects words and phrases that indicate the correction or adjustment of measurements or data.
New Auto-Interp
Negative Logits
니스
-0.07
IXEL
-0.06
arem
-0.06
Recipes
-0.06
/company
-0.06
ammer
-0.06
anno
-0.06
сяг
-0.06
私は
-0.06
anye
-0.05
POSITIVE LOGITS
163
0.07
الدولة
0.07
TED
0.07
055
0.07
698
0.06
home
0.06
927
0.06
факт
0.06
esac
0.06
402
0.06
Activations Density 0.026%