INDEX
Explanations
proper nouns, specifically names and places
New Auto-Interp
Negative Logits
abin
-0.76
Laredo
-0.76
Gehir
-0.72
oil
-0.71
vägen
-0.71
WriteLiteral
-0.68
Albans
-0.68
rtx
-0.65
Preußen
-0.64
fasi
-0.63
POSITIVE LOGITS
Schot
0.83
Franck
0.83
)");
0.82
#+#
0.82
gie
0.81
@@@@@@@@
0.81
oporosis
0.80
Notting
0.79
}{@0.78
serviceWorker
0.77
Activations Density 1.509%