INDEX
Explanations
references to the brand "Nutella"
references to Nut-related products or brands
New Auto-Interp
Negative Logits
hip
-0.75
xual
-0.72
tou
-0.70
silence
-0.69
>>>>>>>>
-0.67
bree
-0.64
acters
-0.64
[+
-0.63
jack
-0.63
pez
-0.62
POSITIVE LOGITS
ritional
1.48
rient
1.18
rients
1.04
nutrit
0.94
agraph
0.88
ron
0.88
tall
0.87
Nut
0.86
ella
0.84
combe
0.82
Activations Density 0.023%