INDEX
Explanations
terms related to endangered whales and their health status
New Auto-Interp
Negative Logits
mund
-0.18
istrate
-0.17
lator
-0.15
klad
-0.15
award
-0.15
prite
-0.15
.PropertyType
-0.15
bach
-0.14
stride
-0.14
lemen
-0.14
POSITIVE LOGITS
.pem
0.16
Lore
0.15
PLE
0.14
.family
0.14
ids
0.14
cased
0.14
-signed
0.14
BU
0.14
lein
0.14
pare
0.13
Activations Density 0.028%