INDEX
Explanations
information related to wolves and their ecological importance
New Auto-Interp
Negative Logits
egg
-0.17
umble
-0.16
ponge
-0.15
Helmet
-0.15
Wheel
-0.14
addin
-0.14
frog
-0.14
urtle
-0.14
IRT
-0.14
èĽ
-0.14
POSITIVE LOGITS
wolf
0.65
wolves
0.61
Wolf
0.59
wol
0.57
wolf
0.54
Wolf
0.52
Wolves
0.51
çĭ¼
0.47
Wol
0.46
lup
0.45
Activations Density 0.075%