INDEX
Explanations
words related to truth or facts
references to "reality."
New Auto-Interp
Negative Logits
indal
-0.77
rav
-0.73
cair
-0.73
ucky
-0.73
ergy
-0.72
asus
-0.72
rosse
-0.71
dies
-0.71
oyal
-0.70
asso
-0.70
POSITIVE LOGITS
ignment
0.90
istically
0.89
psons
0.87
reality
0.85
reality
0.85
srfAttach
0.82
distortion
0.76
fulness
0.75
quickShipAvailable
0.73
Reality
0.70
Activations Density 0.027%