INDEX
Explanations
mentions of something being "not new."
references to the concept of "newness" or the idea that something is not new
New Auto-Interp
Negative Logits
tein
-0.68
Winc
-0.65
thora
-0.64
rotein
-0.63
unk
-0.62
regate
-0.61
fax
-0.60
prus
-0.59
iffs
-0.59
ignt
-0.58
POSITIVE LOGITS
rums
0.77
beginnings
0.72
actionDate
0.71
owan
0.70
lishes
0.69
liest
0.69
infancy
0.68
born
0.67
ORN
0.65
yk
0.64
Activations Density 0.063%