INDEX
Explanations
the word "dig" at the end of words
occurrences of the letters "ig"
New Auto-Interp
Negative Logits
plain
-0.77
ervative
-0.67
Sussex
-0.66
sideline
-0.62
Nurs
-0.61
Somerset
-0.60
WAYS
-0.60
Sov
-0.59
CSI
-0.59
colon
-0.58
POSITIVE LOGITS
rett
1.11
abyte
1.10
wig
1.09
arette
1.08
inning
1.07
ig
1.07
loo
1.04
ga
1.03
ogo
1.02
arettes
1.02
Activations Density 0.015%