INDEX
Explanations
mentions of bipartisan support or unity
references to specific sections or appendices in documents
New Auto-Interp
Negative Logits
maxwell
-0.72
berra
-0.65
catentry
-0.65
ngth
-0.64
atters
-0.64
obyl
-0.64
arya
-0.62
advertising
-0.61
artisan
-0.61
blaster
-0.61
POSITIVE LOGITS
put
0.87
Wire
0.72
wire
0.70
Doe
0.65
lug
0.65
paraph
0.64
type
0.62
nik
0.61
lude
0.61
McGr
0.60
Activations Density 0.000%