INDEX
Explanations
It appears that Neuron 4 did not activate for any of the words provided in the document segments; therefore, no pattern of interest can be identified based on the given information
New Auto-Interp
Negative Logits
Pitt
-0.71
————————————————
-0.65
Worcester
-0.65
fall
-0.64
Hallow
-0.62
Gaza
-0.60
violent
-0.60
00000000
-0.59
Buff
-0.59
Mü
-0.58
POSITIVE LOGITS
ibaba
0.96
llular
0.78
llah
0.75
orney
0.74
kinson
0.73
ynthesis
0.73
inki
0.72
ilton
0.67
velength
0.66
oya
0.66
Activations Density 0.000%
No Known Activations
This feature has no known activations.