INDEX
Explanations
proper nouns or specific names that things are named after
phrases that indicate naming or honoring someone or something
New Auto-Interp
Negative Logits
ylene
-0.82
atisf
-0.77
eat
-0.77
escalate
-0.76
mediated
-0.72
inhibits
-0.71
xc
-0.70
heric
-0.70
iot
-0.70
igi
-0.68
POSITIVE LOGITS
Sanskrit
0.83
Revival
0.78
Franch
0.76
Herbert
0.75
initials
0.74
Scots
0.73
Him
0.73
Guth
0.73
Lore
0.72
Lincoln
0.72
Activations Density 0.072%