INDEX
Explanations
instances of the word "its" paired with a following adjective or noun
New Auto-Interp
Negative Logits
thereby
-0.66
[];
-0.65
ufact
-0.63
>>\
-0.59
duction
-0.59
conom
-0.59
Recomm
-0.59
Transfer
-0.58
ocument
-0.58
rehens
-0.58
POSITIVE LOGITS
own
1.31
sights
1.20
doubts
1.00
detractors
0.93
fingerprints
0.91
tentacles
0.83
faults
0.79
feet
0.78
patented
0.77
strongest
0.76
Activations Density 0.073%