INDEX
Explanations
references to trees in the text
references to trees
New Auto-Interp
Negative Logits
ersive
-0.78
glomer
-0.74
rontal
-0.74
arcer
-0.70
dL
-0.70
ensitive
-0.68
ombat
-0.67
oice
-0.66
Horowitz
-0.66
DOS
-0.63
POSITIVE LOGITS
canopy
1.18
frog
1.09
trees
1.05
stump
1.05
Hug
1.02
beard
1.00
planting
0.97
yard
0.96
tree
0.94
trunk
0.93
Activations Density 0.041%