INDEX
Explanations
The main thing this neuron does is find email addresses or references to a specific online platform with the word "hive"
variations of the word "live."
New Auto-Interp
Negative Logits
acea
-0.79
erva
-0.78
atural
-0.78
committee
-0.76
akuya
-0.75
adr
-0.72
swick
-0.71
é¾įå
-0.71
ologically
-0.70
é¾
-0.69
POSITIVE LOGITS
llo
0.87
ll
0.77
reth
0.77
rics
0.76
lled
0.73
rers
0.71
ttes
0.70
rer
0.69
pods
0.69
lli
0.68
Activations Density 0.016%