INDEX
Explanations
phrases related to things being published or distributed
the word "out" in various contexts
New Auto-Interp
Negative Logits
cious
-0.72
antry
-0.72
cius
-0.71
iosity
-0.63
examiner
-0.60
inski
-0.60
ettlement
-0.59
antine
-0.59
deprivation
-0.56
etched
-0.56
POSITIVE LOGITS
lier
0.93
fitted
0.91
stretched
0.88
casts
0.85
posts
0.83
wards
0.83
doors
0.83
lander
0.80
)=(
0.80
smart
0.80
Activations Density 0.110%