INDEX
Explanations
references to induction into the Hall of Fame
mentions of the Hall of Fame
New Auto-Interp
Negative Logits
eleph
-0.76
Gork
-0.70
oppos
-0.65
ften
-0.63
uyomi
-0.61
ropolitan
-0.61
itars
-0.60
ting
-0.59
lightly
-0.59
ted
-0.58
POSITIVE LOGITS
iday
1.38
Hall
1.13
aday
1.07
ibur
1.05
Hall
1.05
oran
1.00
gren
1.00
hall
0.90
way
0.89
ows
0.88
Activations Density 0.008%