INDEX
Explanations
references to the "Hall of Fame," likely related to sports or other distinguished achievements
references to the Hall of Fame
New Auto-Interp
Negative Logits
eleph
-0.73
oppos
-0.66
phrine
-0.65
ossibility
-0.65
ting
-0.61
nib
-0.60
Gork
-0.60
ted
-0.60
orporated
-0.60
itars
-0.59
POSITIVE LOGITS
iday
1.49
ibur
1.20
aday
1.14
gren
1.10
oran
1.09
ows
1.08
owed
1.05
igan
1.02
marks
1.02
way
0.99
Activations Density 0.010%