INDEX
Explanations
phrases related to paper or situations where paper is involved
references to paper-related materials and products
New Auto-Interp
Negative Logits
cffffcc
-0.78
ivals
-0.75
akening
-0.72
oise
-0.71
CVE
-0.71
alez
-0.70
cius
-0.68
eston
-0.67
ostic
-0.67
aren
-0.66
POSITIVE LOGITS
clip
1.28
towels
1.19
towel
1.01
clips
0.98
backs
0.87
maid
0.86
Paper
0.86
pus
0.84
brush
0.82
pee
0.81
Activations Density 0.020%