INDEX
Explanations
mentions of creative or artistic work
New Auto-Interp
Negative Logits
Ukrain
-0.98
wcs
-0.75
kefeller
-0.71
Args
-0.64
ylon
-0.63
champagne
-0.61
idate
-0.61
angular
-0.61
rition
-0.60
Gord
-0.60
POSITIVE LOGITS
ethic
1.42
flows
1.37
station
1.24
aday
1.15
manship
1.10
bench
1.08
horse
0.98
mates
0.90
forces
0.89
papers
0.89
Activations Density 0.031%