INDEX
Explanations
references to pubs or public houses
mentions of pubs or similar establishments
New Auto-Interp
Negative Logits
Gaia
-0.72
OHN
-0.71
EFF
-0.69
llor
-0.69
IRD
-0.69
Archangel
-0.67
autom
-0.63
ylum
-0.63
xual
-0.61
Reson
-0.61
POSITIVE LOGITS
lish
1.43
lique
1.41
lishing
1.39
lishes
1.37
lisher
1.32
escent
1.21
bing
1.11
lik
1.03
bed
0.95
bish
0.94
Activations Density 0.029%