INDEX
Explanations
references to or mentions of the country "Israel."
mentions of the country Israel
New Auto-Interp
Negative Logits
ĸļ
-0.84
nant
-0.77
geoning
-0.77
mson
-0.77
luaj
-0.71
rencies
-0.71
Jackets
-0.71
robat
-0.69
urations
-0.68
utic
-0.68
POSITIVE LOGITS
Aviv
0.88
ites
0.87
anyahu
0.86
Israel
0.86
aretz
0.81
Israel
0.81
apolis
0.79
ifa
0.76
Juda
0.75
Netanyahu
0.74
Activations Density 0.025%