INDEX
Explanations
references to the concept of 'good'
references to food
New Auto-Interp
Negative Logits
UGE
-0.77
Constantin
-0.74
BuyableInstoreAndOnline
-0.73
Ago
-0.71
MIA
-0.69
USE
-0.68
ASED
-0.66
chal
-0.66
UNCH
-0.66
asper
-0.66
POSITIVE LOGITS
ood
1.21
edly
0.93
ed
0.92
iership
0.91
lers
0.90
rill
0.89
les
0.88
ividual
0.87
ler
0.87
skin
0.86
Activations Density 0.016%