INDEX
Explanations
U.S. state names and universities
New Auto-Interp
Negative Logits
ly
-1.01
iture
-0.93
itional
-0.81
ãĥ¼ãĥĨãĤ£
-0.81
atu
-0.80
sonian
-0.80
HCR
-0.79
*/(
-0.79
elong
-0.78
ãĥ¼ãĥĨ
-0.77
POSITIVE LOGITS
Candle
0.83
APH
0.74
Prophe
0.70
Apostles
0.69
Buc
0.68
650
0.67
GAN
0.62
Lens
0.62
Hook
0.60
apostles
0.60
Activations Density 1.639%