INDEX
Explanations
the word "last" followed by a number
instances of the word "last" in various contexts
New Auto-Interp
Negative Logits
ingen
-0.91
cil
-0.71
nel
-0.67
selves
-0.66
amber
-0.65
ancel
-0.64
hire
-0.64
ourage
-0.62
velt
-0.62
gravity
-0.62
POSITIVE LOGITS
gasp
1.12
vest
1.01
ditch
0.95
straw
0.93
rites
0.90
remnant
0.90
decade
0.88
bast
0.88
installment
0.83
remaining
0.82
Activations Density 0.062%