INDEX
Explanations
proper nouns related to technology, entertainment, and events happening in a work or fictional setting
New Auto-Interp
Negative Logits
margin
-0.58
rising
-0.56
"?
-0.56
foundland
-0.55
%:
-0.54
Related
-0.53
unless
-0.51
Canaver
-0.50
rely
-0.50
TOTAL
-0.49
POSITIVE LOGITS
intervened
0.95
gave
0.86
blew
0.85
opted
0.84
took
0.83
deems
0.78
couldn
0.78
went
0.78
withdrew
0.78
showed
0.77
Activations Density 0.612%