INDEX
Explanations
words related to people's names, specifically the name "William" with a strong activation value
mentions of the name "William."
New Auto-Interp
Negative Logits
yrinth
-0.86
rador
-0.78
ebook
-0.75
wrapper
-0.74
chio
-0.74
displayText
-0.73
asio
-0.71
ADRA
-0.71
ordinate
-0.70
hess
-0.70
POSITIVE LOGITS
son
0.92
\\\\\\\\
0.91
sburg
0.87
Randolph
0.85
stown
0.85
Faul
0.83
Hague
0.83
Wallace
0.80
ette
0.80
William
0.80
Activations Density 0.017%