INDEX
Explanations
The main thing this neuron does is find names or references to the individual "Reynolds."
references to specific individuals or entities, particularly "Reynolds" and related terms
New Auto-Interp
Negative Logits
ulhu
-0.82
»Ĵ
-0.74
Bloom
-0.67
respectively
-0.65
çĶŁ
-0.65
acca
-0.63
inventoryQuantity
-0.62
lehem
-0.60
catentry
-0.60
ARI
-0.60
POSITIVE LOGITS
issance
1.10
itory
0.85
ndum
0.81
enegger
0.80
ption
0.77
earch
0.76
xual
0.75
irmation
0.73
kefeller
0.73
rocal
0.72
Activations Density 0.106%