INDEX
Explanations
mentions of fairy tales and related elements such as characters and themes
references to fairy tales and fantastical elements
New Auto-Interp
Negative Logits
ebus
-0.85
ricted
-0.83
ĵĺ
-0.71
İĭ
-0.70
iferation
-0.70
acks
-0.70
utherford
-0.70
atility
-0.70
aepernick
-0.70
oval
-0.69
POSITIVE LOGITS
tale
1.31
fairy
1.09
Fairy
1.07
tale
1.00
Tale
0.94
tales
0.84
princess
0.81
ãĥ«
0.75
ãĥīãĥ©
0.74
glers
0.73
Activations Density 0.009%