INDEX
Explanations
topics related to government, politics, or military intelligence
details regarding government or legal documents and significant program information
New Auto-Interp
Negative Logits
erential
-0.61
etitive
-0.56
cautiously
-0.55
spective
-0.55
spect
-0.54
astical
-0.53
xtap
-0.53
earcher
-0.53
prompted
-0.52
imb
-0.52
POSITIVE LOGITS
goddamn
0.91
fucking
0.87
fuckin
0.86
shit
0.85
crap
0.85
..."
0.81
gonna
0.80
shitty
0.80
FUCK
0.79
damn
0.79
Activations Density 1.732%