INDEX
Explanations
dialogue or quotations attributed to individuals
New Auto-Interp
Negative Logits
Observer
-0.15
_CNTL
-0.14
observer
-0.14
Observers
-0.14
idine
-0.13
observe
-0.13
Ñ
-0.13
>NN
-0.13
çŀ
-0.13
ãĥ³ãĥĶ
-0.13
POSITIVE LOGITS
shares
0.18
explan
0.18
shared
0.17
Said
0.17
explain
0.16
ilst
0.16
315
0.16
descr
0.16
describe
0.15
ansa
0.15
Activations Density 0.070%