INDEX
Explanations
proper nouns or names
distinct names or identifiers related to people or entities
New Auto-Interp
Negative Logits
Giles
-0.81
Ange
-0.78
Hur
-0.73
Gat
-0.72
agra
-0.69
Hayward
-0.69
Gale
-0.69
Hastings
-0.68
Chev
-0.67
HAR
-0.67
POSITIVE LOGITS
n
1.52
N
1.47
ni
1.46
NN
1.41
Ns
1.38
nn
1.36
nb
1.36
NI
1.35
nr
1.30
NT
1.30
Activations Density 0.585%