INDEX
Explanations
names of professions or titles
New Auto-Interp
Negative Logits
Operation
-0.61
Pixie
-0.61
Hyp
-0.58
downside
-0.58
ãĥį
-0.58
noon
-0.57
Angle
-0.56
DEN
-0.56
Invasion
-0.56
PDATE
-0.56
POSITIVE LOGITS
etc
1.24
mith
1.12
hips
1.06
folk
1.02
paces
1.02
etc
0.95
cript
0.94
hare
0.90
hops
0.86
chool
0.86
Activations Density 0.135%