INDEX
Explanations
phrases related to hub or hubris
references to various hubs or central points in a context
New Auto-Interp
Negative Logits
terday
-0.69
izations
-0.68
anguage
-0.67
graded
-0.67
Grave
-0.66
ists
-0.65
IGHTS
-0.65
Jackets
-0.65
izable
-0.64
Chatt
-0.62
POSITIVE LOGITS
bub
1.48
bard
1.33
ris
1.08
lins
1.04
hub
1.02
staff
1.02
bies
1.02
lot
0.98
Hub
0.97
bie
0.95
Activations Density 0.027%