INDEX
Explanations
The neuron activates on occurrences of the word “valid.”
New Auto-Interp
Negative Logits
Between
-0.07
compression
-0.07
Cluster
-0.07
house
-0.07
_course
-0.07
Incorrect
-0.07
maze
-0.07
responseObject
-0.06
Increment
-0.06
Floor
-0.06
POSITIVE LOGITS
valid
0.11
Valid
0.11
salvage
0.08
good
0.07
(valid
0.07
fair
0.07
VALID
0.07
stands
0.07
solids
0.07
.Valid
0.07
Activations Density 0.008%