INDEX
Explanations
the word "Dist" or its variations
various forms of the word "distribute" or related terms
New Auto-Interp
Negative Logits
glers
-1.17
swick
-0.93
terday
-0.86
ppo
-0.83
tes
-0.79
fter
-0.70
gery
-0.70
ton
-0.70
theless
-0.69
ggle
-0.68
POSITIVE LOGITS
ribut
1.42
ributed
1.41
ribute
1.23
ribution
1.22
illery
1.20
inguished
1.19
ortion
1.14
rict
1.11
inctions
1.06
inct
1.04
Activations Density 0.013%