INDEX
Explanations
names of animals
names of animals or characters in titles
New Auto-Interp
Negative Logits
orship
-0.73
AFP
-0.71
ancies
-0.71
avail
-0.67
franc
-0.67
aloud
-0.66
isol
-0.66
impart
-0.66
icably
-0.65
ervation
-0.64
POSITIVE LOGITS
Tail
1.10
Squad
1.07
Nation
1.06
Tracks
1.06
Nation
1.04
Claw
1.02
Creek
1.02
Girl
1.01
Mania
0.98
Soup
0.97
Activations Density 0.135%