INDEX
Explanations
I'm sorry, I couldn't extract any particular pattern or theme from the activations in neuron 4 for this document
references to the substance 'esc' or variations of it, likely related to 'escapism'
New Auto-Interp
Negative Logits
Surface
-0.71
女
-0.68
Kinect
-0.63
Archdemon
-0.63
WP
-0.62
Monthly
-0.62
à¨
-0.62
Lumia
-0.61
bitch
-0.59
Avery
-0.59
POSITIVE LOGITS
aped
1.22
esc
1.19
apes
1.16
ribed
1.11
orts
1.11
ript
1.09
autions
1.07
ence
1.07
opes
1.04
apers
1.00
Activations Density 0.005%