INDEX
Explanations
words with the suffix "-en"
a variety of proper nouns and technical terms
New Auto-Interp
Negative Logits
Carib
-0.90
Boyle
-0.80
Bog
-0.77
tyr
-0.77
Bohem
-0.75
Rabbit
-0.75
Bolivia
-0.75
MGM
-0.74
rag
-0.74
Cycl
-0.74
POSITIVE LOGITS
en
1.79
EN
1.61
ename
1.42
enf
1.41
ens
1.39
ena
1.35
enh
1.33
eni
1.31
eng
1.28
eny
1.28
Activations Density 0.141%