INDEX
Explanations
references to Osama bin Laden
references to Osama bin Laden
New Auto-Interp
Negative Logits
hire
-0.73
WD
-0.71
eries
-0.70
âĢ¢âĢ¢
-0.68
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
-0.68
Drawn
-0.67
CAP
-0.66
place
-0.65
Concord
-0.65
haw
-0.65
POSITIVE LOGITS
bin
1.22
Osama
1.12
Bin
1.04
rall
1.01
Laden
0.96
Hussein
0.88
abad
0.82
Mubarak
0.81
atis
0.81
Tayyip
0.80
Activations Density 0.004%