INDEX
Explanations
The main thing this neuron does is find occurrences of the word "barrage"
terms related to overwhelming attacks or assaults
the word 'barrage'
New Auto-Interp
Negative Logits
OTA
-0.93
uana
-0.80
cius
-0.72
ident
-0.70
occ
-0.69
basketball
-0.68
benef
-0.67
Utah
-0.66
Crime
-0.65
luck
-0.65
POSITIVE LOGITS
barrage
1.23
bombard
1.20
barr
1.11
inund
1.06
bombardment
1.05
pounded
0.93
onslaught
0.85
geoning
0.80
ãĥ£
0.79
raged
0.76
Activations Density 0.014%