INDEX
Explanations
references to a specific place or organization, Fort Detrick
references to the word "Detroit."
New Auto-Interp
Negative Logits
xual
-0.87
nown
-0.77
hetti
-0.75
manship
-0.73
ammy
-0.70
drive
-0.70
toggle
-0.67
inventoryQuantity
-0.67
actionGroup
-0.67
perm
-0.66
POSITIVE LOGITS
ector
1.21
ective
1.18
ected
1.14
ention
1.06
ection
1.03
rans
0.99
ail
0.98
roit
0.97
ailed
0.96
ainer
0.96
Activations Density 0.018%