INDEX
Explanations
references to concepts of wholeness or totality
New Auto-Interp
Negative Logits
uled
-0.16
jen
-0.16
light
-0.16
UBL
-0.15
subtype
-0.15
iest
-0.14
certain
-0.14
ron
-0.14
iji
-0.14
illin
-0.14
POSITIVE LOGITS
thing
0.28
ties
0.26
spectrum
0.23
gam
0.22
pectrum
0.20
thing
0.20
heart
0.19
Thing
0.19
Thing
0.19
breadth
0.18
Activations Density 0.040%