INDEX
Explanations
references to "House of" followed by various nouns or terms related to titles or names
New Auto-Interp
Negative Logits
zer
-0.15
iy
-0.15
lif
-0.14
ges
-0.14
yy
-0.14
ůsob
-0.14
Fresh
-0.14
chaft
-0.14
Fresh
-0.14
_lifetime
-0.14
POSITIVE LOGITS
House
0.22
house
0.17
-corner
0.16
Perc
0.15
HOUSE
0.15
ouse
0.15
å¹ķ
0.15
House
0.14
Representatives
0.14
.DESC
0.14
Activations Density 0.029%