INDEX
Explanations
references to the word "Earth" and its variations
New Auto-Interp
Negative Logits
erglass
-0.17
kus
-0.16
ĵ¨
-0.15
egin
-0.15
kop
-0.15
ãĥ©ãĥĥãĤ¯
-0.15
sters
-0.14
_AI
-0.14
uses
-0.14
anders
-0.14
POSITIVE LOGITS
quake
0.38
worm
0.32
bound
0.31
lings
0.30
ling
0.27
qu
0.26
moving
0.23
ly
0.22
quake
0.21
lier
0.21
Activations Density 0.018%