INDEX
Explanations
proper nouns, specifically "Ga" followed by various different suffixes or words
references to the Gaussian distribution
New Auto-Interp
Negative Logits
enance
-0.80
FACE
-0.77
ty
-0.74
taking
-0.73
tenance
-0.72
PORT
-0.71
tic
-0.71
suit
-0.69
gdala
-0.68
sburgh
-0.67
POSITIVE LOGITS
Ga
1.16
ither
1.11
keye
1.05
vernment
1.03
Ga
0.97
pless
0.93
uth
0.88
illard
0.86
etz
0.84
ussian
0.83
Activations Density 0.012%