INDEX
Explanations
occurrences of the word "for" in various contexts
New Auto-Interp
Negative Logits
aven
-0.20
387
-0.17
makers
-0.17
yan
-0.16
yon
-0.16
yla
-0.15
ammers
-0.15
elian
-0.15
regards
-0.15
s
-0.15
POSITIVE LOGITS
Sale
0.26
sale
0.24
bidden
0.22
Sale
0.22
iginal
0.22
-sale
0.22
êt
0.22
SALE
0.21
Hire
0.20
feit
0.20
Activations Density 0.143%