INDEX
Explanations
names of people and places
names of people, especially those related to significant events or contributions
New Auto-Interp
Negative Logits
ctica
-0.75
oday
-0.72
rium
-0.71
hya
-0.71
rate
-0.71
ESH
-0.70
nces
-0.69
ration
-0.69
attery
-0.68
ONY
-0.68
POSITIVE LOGITS
ptr
0.75
pherd
0.74
mington
0.73
enburg
0.67
ipple
0.67
Shepherd
0.65
Materials
0.65
wcs
0.64
heck
0.63
Islands
0.61
Activations Density 0.020%