INDEX
Explanations
specific words related to a "pub"
references to pubs
New Auto-Interp
Negative Logits
OHN
-0.80
llor
-0.75
IRD
-0.72
Marketable
-0.72
Gaia
-0.72
ioxide
-0.69
GOODMAN
-0.66
Archangel
-0.66
Reson
-0.66
EFF
-0.65
POSITIVE LOGITS
lique
1.35
lishes
1.33
lish
1.26
lisher
1.22
escent
1.21
lishing
1.20
bing
0.98
lik
0.97
crawl
0.91
ishes
0.90
Activations Density 0.014%