INDEX
Explanations
animal-related words and locations, particularly zoos
mention of the word "Zoo" in various contexts
New Auto-Interp
Negative Logits
pring
-0.81
Interstitial
-0.79
acies
-0.74
aneous
-0.74
idate
-0.72
ÑĮ
-0.71
itimate
-0.70
iating
-0.70
owship
-0.69
rity
-0.69
POSITIVE LOGITS
Zoo
0.92
Tycoon
0.87
onga
0.87
biology
0.83
opia
0.82
elling
0.82
ÅĤ
0.81
ey
0.80
zilla
0.80
zoo
0.79
Activations Density 0.017%