INDEX
Explanations
The neuron spotlights occurrences of the word “fitness” in software‐license disclaimer phrases (as in “fitness for a particular purpose”).
New Auto-Interp
Negative Logits
BU
-0.07
NE
-0.06
perial
-0.06
جو
-0.06
Maybe
-0.06
_packet
-0.06
画
-0.06
主任
-0.06
_que
-0.06
RTE
-0.06
POSITIVE LOGITS
živ
0.07
/icons
0.07
Cities
0.06
Комп
0.06
aney
0.06
FITNESS
0.06
everywhere
0.06
kaç
0.06
تكييف
0.06
Joe
0.06
Activations Density 0.000%