INDEX
Explanations
proper nouns related to political figures and government positions
references to significant political figures and their roles
New Auto-Interp
Negative Logits
âĸ¬
-0.76
VID
-0.73
LOAD
-0.71
NAT
-0.69
ISP
-0.64
VID
-0.63
Sensor
-0.63
reproduction
-0.60
Ow
-0.60
frame
-0.60
POSITIVE LOGITS
Tillerson
1.17
Jinping
0.86
ython
0.81
ongyang
0.80
eln
0.80
uchin
0.79
DeVos
0.79
memos
0.78
utations
0.77
erella
0.77
Activations Density 0.022%