INDEX
Explanations
references to the word "elephant" and related terms like "ivory" and "seals"
references to elephants and related topics
New Auto-Interp
Negative Logits
pring
-0.86
lly
-0.85
nder
-0.84
nergy
-0.82
ndra
-0.80
lished
-0.80
nces
-0.80
nda
-0.78
nerg
-0.76
mble
-0.74
POSITIVE LOGITS
elephant
1.32
elephants
1.29
iasis
1.15
Elephant
1.05
herds
0.97
calf
0.91
ivory
0.88
penis
0.88
Haram
0.86
ota
0.86
Activations Density 0.015%