INDEX
Explanations
the root cause or origin of something
words related to the concept of origin or causation
New Auto-Interp
Negative Logits
Bought
-0.59
booked
-0.59
oppy
-0.56
Preview
-0.54
ockets
-0.54
Rated
-0.54
phis
-0.54
istors
-0.53
maneu
-0.53
Recomm
-0.53
POSITIVE LOGITS
from
1.16
from
1.06
FROM
1.02
From
0.98
solely
0.90
principally
0.89
From
0.88
directly
0.86
derive
0.85
chiefly
0.80
Activations Density 0.079%