INDEX
Explanations
references to individual names and titles
New Auto-Interp
Negative Logits
Karel
-0.18
Voor
-0.17
erman
-0.16
Powers
-0.16
Gew
-0.15
Morse
-0.15
çijŀ
-0.15
utch
-0.15
ppy
-0.14
azor
-0.14
POSITIVE LOGITS
iÃŁ
0.20
Sting
0.19
-Christian
0.19
Broker
0.19
Diet
0.18
Dipl
0.17
Becker
0.17
asers
0.16
Loch
0.16
Bless
0.16
Activations Density 0.141%