INDEX
Explanations
references to Hong Kong and related geographic or cultural terms
New Auto-Interp
Negative Logits
meneu
-0.69
actable
-0.57
אל
-0.56
aen
-0.55
Rüyada
-0.54
Tribes
-0.52
Myles
-0.52
メン
-0.52
cle
-0.52
cillas
-0.52
POSITIVE LOGITS
Hongkong
1.52
Kong
1.43
Kong
1.31
KONG
1.30
Hong
1.10
kong
1.06
HK
1.05
kong
1.05
香港
1.05
Macao
0.94
Activations Density 0.010%