INDEX
Explanations
terms related to the environment, specifically bush and forest-related words
references to "bush" and its related contexts, along with occasional mentions of "womb" and "foss."
New Auto-Interp
Negative Logits
Hamburg
-0.67
ructose
-0.67
Else
-0.66
earable
-0.64
ications
-0.63
insidious
-0.63
cussion
-0.63
*/(
-0.62
oppable
-0.62
oric
-0.61
POSITIVE LOGITS
fires
1.10
bush
1.04
craft
1.03
wh
0.90
fire
0.88
meat
0.87
tree
0.87
mere
0.86
ido
0.82
pole
0.80
Activations Density 0.006%