INDEX
Explanations
instances of the word "layer" and assign high activation values to any numerical values associated with it
references to layers in various contexts
New Auto-Interp
Negative Logits
date
-0.71
ICES
-0.66
Predators
-0.64
orth
-0.64
Scand
-0.63
transform
-0.63
Latin
-0.63
speaking
-0.63
STAR
-0.62
opened
-0.61
POSITIVE LOGITS
layer
1.50
layers
1.46
layer
1.24
Layer
1.24
Layer
1.20
layered
0.86
ayers
0.84
thickness
0.83
coats
0.78
htaking
0.77
Activations Density 0.008%