INDEX
Explanations
It seems like neuron 4 is looking for phrases related to rocky terrain or landscapes
the word "ooth" and its variations, indicating a focus on specific phonetic patterns
New Auto-Interp
Negative Logits
derivatives
-0.68
risk
-0.66
list
-0.66
Luk
-0.64
perish
-0.64
lib
-0.64
Reuters
-0.62
search
-0.62
Copyright
-0.62
author
-0.62
POSITIVE LOGITS
ooth
4.68
oother
1.42
ranch
1.22
zee
1.20
oot
1.10
reet
1.01
owl
0.99
ridges
0.99
achine
0.97
oke
0.95
Activations Density 0.023%