INDEX
Explanations
references to fairy tales and related themes
New Auto-Interp
Negative Logits
aeda
-0.87
atility
-0.82
ebus
-0.81
ibaba
-0.80
upon
-0.76
iating
-0.76
inventoryQuantity
-0.75
isitions
-0.74
iferation
-0.73
artney
-0.73
POSITIVE LOGITS
tale
1.37
tale
1.25
tales
1.04
Tale
0.93
fairy
0.91
princess
0.87
tailed
0.80
tail
0.76
lla
0.74
Tales
0.73
Activations Density 0.009%