INDEX
Explanations
references to transforming into different objects or characters
references to imaginative or absurd scenarios
New Auto-Interp
Negative Logits
Initial
-0.87
imester
-0.86
gression
-0.85
Recommend
-0.85
Register
-0.83
autions
-0.82
Media
-0.82
mediate
-0.82
Policy
-0.81
requent
-0.80
POSITIVE LOGITS
dinosaurs
1.35
pengu
1.30
Godzilla
1.30
dolphins
1.29
Bigfoot
1.26
sharks
1.23
UFOs
1.22
gorilla
1.22
dinosaur
1.22
Dracula
1.20
Activations Density 0.725%