INDEX
Explanations
words related to paper, including "paperwork."
references to various types of paper products
New Auto-Interp
Negative Logits
ostic
-0.79
CVE
-0.77
artment
-0.73
alez
-0.72
akening
-0.70
aren
-0.70
ivals
-0.69
Xi
-0.68
^^^^
-0.67
ostics
-0.67
POSITIVE LOGITS
towels
1.09
clip
1.09
Paper
1.07
towel
0.97
clips
0.89
Paper
0.86
paper
0.81
maid
0.81
flies
0.80
pee
0.79
Activations Density 0.022%