INDEX
Explanations
references to a specific location, Fort Detrick, within various contexts
references to Detroit and its related entities
New Auto-Interp
Negative Logits
manship
-0.89
xual
-0.88
nown
-0.79
ammy
-0.76
hetti
-0.74
externalActionCode
-0.69
lihood
-0.66
#$
-0.65
heit
-0.62
deaf
-0.62
POSITIVE LOGITS
ector
1.18
ention
1.12
ected
1.12
rans
1.11
roit
1.06
ailed
1.03
ective
1.02
ainers
1.01
ainer
0.98
ection
0.97
Activations Density 0.014%