INDEX
Explanations
references to mythology and folklore
New Auto-Interp
Negative Logits
erate
-0.83
neau
-0.79
emouth
-0.78
illon
-0.64
cker
-0.64
Housing
-0.63
ney
-0.63
iment
-0.62
achment
-0.61
nington
-0.61
POSITIVE LOGITS
lore
1.00
tales
0.98
mythology
0.94
tale
0.83
folklore
0.79
myths
0.78
traditions
0.76
lore
0.75
buffs
0.75
arium
0.75
Activations Density 0.012%