INDEX
Explanations
locations and names with Japanese origin
references to political figures and governmental responsibilities in Japan
New Auto-Interp
Negative Logits
uliffe
-0.96
BLM
-0.85
Lyons
-0.84
̶
-0.83
Philly
-0.82
Bray
-0.81
Colorado
-0.80
Dixon
-0.77
Malone
-0.76
espie
-0.76
POSITIVE LOGITS
Japanese
1.91
Osaka
1.90
Japan
1.88
Japanese
1.87
Japan
1.86
Tokyo
1.86
Kyoto
1.81
Tok
1.80
Hiro
1.75
Fuji
1.75
Activations Density 0.721%