INDEX
Explanations
patterns or markers in numerical data or structured information
New Auto-Interp
Negative Logits
in
-0.64
zu
-0.64
im
-0.62
to
-0.60
pres
-0.59
mot
-0.59
kan
-0.59
${-0.57
k
-0.56
sem
-0.56
POSITIVE LOGITS
myſelf
1.46
itſelf
1.42
ſtate
1.32
ſmall
1.28
Anſ
1.27
pleaſure
1.26
greateſt
1.25
doubtnut
1.25
Efq
1.23
Houſe
1.22
Activations Density 1.196%