INDEX
Explanations
mentions of places, possibly related to British regions or universities, specifically "Ang-" followed by a numeric value
references to the Anglophone culture or societies
New Auto-Interp
Negative Logits
externalToEVAOnly
-0.88
ãĥ¼ãĥ³
-0.86
cloth
-0.76
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.76
igslist
-0.73
pmwiki
-0.71
wagen
-0.71
Ö¼
-0.70
ometimes
-0.68
ords
-0.67
POSITIVE LOGITS
lia
1.08
rily
1.05
sty
0.99
uish
0.95
emouth
0.92
lyn
0.91
Ang
0.90
Ang
0.88
rab
0.79
roup
0.79
Activations Density 0.014%