INDEX
Explanations
mentions of legends and mythical figures
references to legends or legendary figures/events
New Auto-Interp
Negative Logits
tan
-0.71
Parenthood
-0.68
CLA
-0.61
overflow
-0.59
Adobe
-0.57
DP
-0.57
anus
-0.57
agitation
-0.56
policy
-0.56
Consent
-0.56
POSITIVE LOGITS
arily
1.48
aries
1.12
arium
1.05
naire
1.02
ical
0.84
icist
0.83
ends
0.81
arious
0.80
naires
0.79
ic
0.79
Activations Density 0.053%