INDEX
Explanations
phrases related to the terrorist Osama bin Laden
references to Osama bin Laden
New Auto-Interp
Negative Logits
eries
-0.76
haw
-0.74
hire
-0.73
drawn
-0.70
Drawn
-0.69
yards
-0.67
ships
-0.67
CAP
-0.67
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
-0.66
WD
-0.65
POSITIVE LOGITS
bin
1.20
Bin
1.07
Osama
0.99
Laden
0.92
Hussein
0.91
rall
0.87
Hassan
0.79
atis
0.79
Sharif
0.78
ibn
0.78
Activations Density 0.006%