INDEX
Explanations
references to environmental issues and ecological consequences
New Auto-Interp
Negative Logits
drown
-0.16
amphib
-0.16
alance
-0.16
ducks
-0.16
Ducks
-0.15
/dom
-0.15
orest
-0.15
rodents
-0.15
duck
-0.15
grese
-0.15
POSITIVE LOGITS
coral
0.23
reefs
0.22
Coral
0.22
Reef
0.19
reef
0.19
çı
0.18
MOTE
0.17
çīĻ
0.16
atu
0.16
ç¤
0.15
Activations Density 0.057%