INDEX
Explanations
words related to people's names, particularly surnames
words related to people's names or surnames
New Auto-Interp
Negative Logits
Galileo
-0.72
Croatian
-0.67
Worlds
-0.65
Baal
-0.63
gradient
-0.63
[|
-0.61
Pastebin
-0.60
Eleven
-0.59
ibrary
-0.59
Metatron
-0.59
POSITIVE LOGITS
issance
0.82
ney
0.82
lish
0.82
zie
0.79
aghan
0.78
igan
0.76
agall
0.76
SHA
0.75
ulty
0.75
ynski
0.73
Activations Density 0.071%