INDEX
Explanations
references to art or artistic activities
references to art and artistic expression
New Auto-Interp
Negative Logits
DN
-0.70
htt
-0.68
Crosby
-0.67
Wyoming
-0.66
sshd
-0.65
McGee
-0.61
ership
-0.61
KI
-0.61
pora
-0.60
inki
-0.60
POSITIVE LOGITS
istry
1.57
isans
1.43
emis
1.35
ifice
1.34
esian
1.29
works
1.29
ifacts
1.16
illery
1.14
ificial
1.07
icles
1.00
Activations Density 0.025%