INDEX
Explanations
references to the city of Beijing
mentions of Beijing
New Auto-Interp
Negative Logits
oway
-0.73
adoes
-0.73
osate
-0.72
RH
-0.72
Trooper
-0.69
pent
-0.68
âĢ¢âĢ¢âĢ¢âĢ¢
-0.68
ocene
-0.67
RANT
-0.67
Hitchcock
-0.65
POSITIVE LOGITS
ijing
1.17
Jinping
1.04
Beijing
1.00
Lumpur
0.99
Yuan
0.89
jing
0.88
zhou
0.86
jin
0.84
Jing
0.81
wei
0.81
Activations Density 0.012%