INDEX
Explanations
the name "Netanyahu" as a proper noun
mentions of Benjamin Netanyahu
New Auto-Interp
Negative Logits
teenth
-0.87
Pyth
-0.85
Reviewer
-0.84
itialized
-0.77
anwhile
-0.76
ively
-0.76
ĸļ
-0.72
orative
-0.70
ibles
-0.68
ebin
-0.68
POSITIVE LOGITS
anyahu
1.21
Netanyahu
1.05
ministerial
0.88
Jinping
0.82
anca
0.81
stein
0.79
etz
0.78
bloc
0.76
itz
0.74
Aviv
0.73
Activations Density 0.010%