INDEX
Explanations
references to specific animals, such as dwarves, elephants, tigers, and monkeys
references to fantasy or mythical creatures, animals, and characters
New Auto-Interp
Negative Logits
nce
-0.70
aton
-0.69
549
-0.69
NC
-0.69
wise
-0.68
Asset
-0.68
IP
-0.67
ty
-0.67
lic
-0.66
York
-0.66
POSITIVE LOGITS
aurus
1.21
hip
1.17
ervatives
1.12
mith
1.06
ongs
1.04
uggest
1.01
terday
1.00
paces
1.00
agascar
0.99
ettings
0.97
Activations Density 0.096%