INDEX
Explanations
references to quality literature or artistic works
New Auto-Interp
Negative Logits
bourg
-0.16
deaux
-0.16
Famous
-0.15
subroutine
-0.15
cone
-0.15
lobals
-0.14
zilla
-0.14
onis
-0.14
òng
-0.14
ancode
-0.13
POSITIVE LOGITS
gem
0.22
gem
0.20
Gem
0.19
rarity
0.18
winner
0.17
welcome
0.17
Gem
0.17
pleasure
0.16
force
0.16
feast
0.16
Activations Density 0.134%