INDEX
Explanations
entities with names or phrases that contain the characters "Ga"
references to the Gaussian distribution
New Auto-Interp
Negative Logits
gdala
-0.78
FACE
-0.74
PORT
-0.73
sburgh
-0.72
ty
-0.71
enance
-0.70
TY
-0.69
LEASE
-0.68
Ö¼
-0.68
ACTED
-0.65
POSITIVE LOGITS
ither
1.15
vernment
1.14
keye
1.06
Ga
1.01
ussian
0.97
pless
0.95
Ga
0.85
uss
0.84
illard
0.81
urd
0.79
Activations Density 0.034%