INDEX
Explanations
phrases containing the word "Hill" followed by a number or description
references to "The Hill," a political news website and publication
New Auto-Interp
Negative Logits
âĹ¼
-0.83
============
-0.71
âĢ¢âĢ¢âĢ¢âĢ¢
-0.68
cial
-0.68
========
-0.67
Slot
-0.67
Export
-0.65
Emir
-0.64
Wan
-0.64
Marketable
-0.64
POSITIVE LOGITS
iard
1.12
Hill
1.06
stones
0.92
side
0.92
dale
0.88
boro
0.87
castle
0.87
hill
0.85
mont
0.83
stone
0.83
Activations Density 0.011%