INDEX
Explanations
proper nouns related to companies or people
New Auto-Interp
Negative Logits
awa
-0.76
Independence
-0.74
indist
-0.72
FORM
-0.72
Connie
-0.71
recre
-0.70
Edmund
-0.69
accompan
-0.69
Mystic
-0.68
CON
-0.68
POSITIVE LOGITS
z
1.72
zos
1.43
zan
1.42
zon
1.40
Z
1.38
zes
1.37
zie
1.35
zu
1.34
zo
1.33
zing
1.32
Activations Density 0.070%