INDEX
Explanations
references to various forms and concepts of art
New Auto-Interp
Negative Logits
erland
-0.25
er
-0.23
heart
-0.19
erator
-0.17
erer
-0.17
erd
-0.16
estone
-0.16
art
-0.16
aries
-0.16
gers
-0.15
POSITIVE LOGITS
ifice
0.32
istry
0.31
ifacts
0.25
ificial
0.25
illery
0.23
fully
0.23
esian
0.21
ful
0.19
nouveau
0.18
icular
0.18
Activations Density 0.049%