INDEX
Explanations
references to measurements of distance
New Auto-Interp
Negative Logits
ulate
-0.84
stal
-0.75
essee
-0.75
umption
-0.73
steen
-0.72
ovy
-0.71
ovie
-0.69
udeb
-0.69
dom
-0.68
icious
-0.67
POSITIVE LOGITS
travelled
1.17
traveled
1.04
distances
1.04
distance
0.96
separating
0.84
finder
0.79
between
0.78
Distance
0.77
flung
0.74
AGE
0.74
Activations Density 0.008%