INDEX
Explanations
instances of the word "Now" indicating a transition or change in focus
New Auto-Interp
Negative Logits
yah
-0.16
argo
-0.14
irst
-0.14
arya
-0.14
umpt
-0.14
erde
-0.13
ah
-0.13
iagnostics
-0.13
quisites
-0.13
mieux
-0.13
POSITIVE LOGITS
here
0.20
granted
0.19
mind
0.18
onto
0.17
ise
0.16
ãĥ¼ãĥª
0.16
ww
0.15
adays
0.15
Granted
0.15
withstanding
0.14
Activations Density 0.021%