INDEX
Explanations
names of people, especially those emphasizing specific letters in their names
New Auto-Interp
Negative Logits
tremend
-0.91
izoph
-0.70
DRAG
-0.70
»Ĵ
-0.69
ULAR
-0.68
glim
-0.68
ikuman
-0.66
ccording
-0.66
srfAttach
-0.66
eatures
-0.65
POSITIVE LOGITS
kamp
1.04
mann
0.95
berg
0.90
feld
0.88
eman
0.86
wald
0.85
hoff
0.84
berger
0.84
croft
0.83
Productions
0.82
Activations Density 0.159%