INDEX
Explanations
the word "Israel" and related phrases
mentions of the country Israel
New Auto-Interp
Negative Logits
ĸļ
-0.76
nant
-0.75
luaj
-0.74
ttes
-0.74
iddler
-0.73
utic
-0.73
mson
-0.70
apple
-0.68
Jackets
-0.67
geoning
-0.64
POSITIVE LOGITS
anyahu
0.90
Aviv
0.90
ites
0.85
aretz
0.79
ifa
0.76
Israel
0.75
apolis
0.74
Israel
0.71
usalem
0.68
Netanyahu
0.68
Activations Density 0.025%