INDEX
Explanations
mentions of the location "Hong Kong"
mentions of Hong Kong
New Auto-Interp
Negative Logits
ally
-0.70
selection
-0.67
nesota
-0.66
placeholder
-0.65
kick
-0.64
Archdemon
-0.64
igm
-0.64
guided
-0.62
wedge
-0.62
unct
-0.61
POSITIVE LOGITS
Kong
1.35
awei
1.09
Hong
0.99
Hong
0.92
Hua
0.87
Zhu
0.82
Wong
0.80
moon
0.79
Tai
0.78
terness
0.77
Activations Density 0.007%