INDEX
Explanations
negative or questioning statements about the system in use
discussions about perceptions or descriptions of reality and identity
New Auto-Interp
Negative Logits
unparalleled
-0.60
culminating
-0.59
accompan
-0.57
assorted
-0.57
éĩ
-0.55
rather
-0.55
progressively
-0.54
tantal
-0.52
seemingly
-0.52
almost
-0.52
POSITIVE LOGITS
anymore
1.65
nor
1.45
yet
1.24
nor
1.11
yet
0.97
:(
0.97
slightest
0.87
necessarily
0.87
unless
0.83
anything
0.82
Activations Density 0.521%