INDEX
Explanations
proper nouns, particularly geographical names and institutions
New Auto-Interp
Negative Logits
a
-0.64
کنون
-0.62
льевич
-0.59
-0.56
suivante
-0.55
ViewFeatures
-0.55
seguente
-0.54
pubblici
-0.53
@[+][
-0.52
Omaha
-0.52
POSITIVE LOGITS
Efq
0.86
0.82
Савезне
0.80
Beſ
0.78
########.
0.76
ſever
0.73
Quelles
0.72
Anſ
0.72
tvguidetime
0.71
downg
0.71
Activations Density 0.605%