INDEX
Explanations
references to the term "Moore"
the recurring mention of the name "Moore."
New Auto-Interp
Negative Logits
ERAL
-0.84
ropolitan
-0.82
ICAN
-0.73
ivity
-0.73
æĺ¯
-0.68
Spanish
-0.66
===
-0.66
rious
-0.66
Hungarian
-0.66
lift
-0.65
POSITIVE LOGITS
Moore
1.15
Moore
0.91
stown
0.79
oshenko
0.75
ufact
0.75
cale
0.75
acre
0.74
zilla
0.74
sey
0.73
cloth
0.73
Activations Density 0.006%