INDEX
Explanations
proper nouns related to individuals named "Arthur"
references to the name "Arthur" in various contexts
New Auto-Interp
Negative Logits
ongyang
-0.86
ramid
-0.77
ple
-0.72
initialized
-0.72
ering
-0.71
arity
-0.71
plings
-0.70
artisan
-0.70
utation
-0.69
ered
-0.67
POSITIVE LOGITS
Ashe
1.13
Conan
1.06
Weasley
1.01
Pend
0.95
Arthur
0.84
andise
0.83
ufact
0.82
Guinness
0.80
ian
0.80
Andersen
0.80
Activations Density 0.035%