INDEX
Explanations
references to geographic locations and academic institutions
New Auto-Interp
Negative Logits
itzer
-0.17
iti
-0.17
asti
-0.16
ãĥ©ãĥĥãĤ¯
-0.16
reau
-0.16
ãĥ«ãĤ¯
-0.15
gang
-0.15
orus
-0.15
ivant
-0.15
egot
-0.15
POSITIVE LOGITS
Ness
0.19
USD
0.18
Pretty
0.18
Haven
0.18
Burl
0.18
Pratt
0.17
Coff
0.17
Solomon
0.17
Ottawa
0.17
Chan
0.17
Activations Density 0.004%