INDEX
Explanations
references to government and national security programs
New Auto-Interp
Negative Logits
neau
-0.15
Loft
-0.15
Fav
-0.14
onia
-0.14
avia
-0.14
çħĻ
-0.14
amel
-0.14
egen
-0.13
olini
-0.13
ãĤ«ãĥĨ
-0.13
POSITIVE LOGITS
DOE
0.26
Oak
0.21
.energy
0.21
Liver
0.20
LBL
0.19
Contractor
0.18
Energy
0.18
Oak
0.18
plut
0.17
.pnl
0.17
Activations Density 0.030%