INDEX
Explanations
references to specific individuals or unique identifiers
New Auto-Interp
Negative Logits
ạch
-0.15
ncy
-0.14
åħ¬åħ±
-0.13
trÃŃ
-0.13
Duc
-0.13
amarin
-0.13
mÃŃ
-0.13
GPLv
-0.13
амп
-0.13
duke
-0.12
POSITIVE LOGITS
Moon
0.36
Alpha
0.31
Moon
0.29
moon
0.26
Alpha
0.25
Voyager
0.25
ALPHA
0.25
astronauts
0.24
lunar
0.24
Apollo
0.23
Activations Density 0.005%