INDEX
Explanations
occurrences of the word "the"
New Auto-Interp
Negative Logits
ameda
-0.16
bò
-0.15
+-+-+-+-+-+-+-+-
-0.14
urances
-0.14
yssey
-0.14
isher
-0.14
Loren
-0.14
utron
-0.13
ulus
-0.13
ustanov
-0.13
POSITIVE LOGITS
world
0.29
planet
0.29
industry
0.24
history
0.23
universe
0.23
country
0.22
hemisphere
0.22
world
0.21
globe
0.21
entire
0.21
Activations Density 0.041%