INDEX
Explanations
companies or brands
instances of the word "Win" in various contexts
New Auto-Interp
Negative Logits
gravity
-0.70
populated
-0.65
£ı
-0.65
gat
-0.61
intestine
-0.61
pree
-0.60
reservations
-0.59
warr
-0.59
reluct
-0.59
moon
-0.59
POSITIVE LOGITS
ners
1.26
throp
1.17
frey
1.13
ning
1.08
ston
1.06
fred
1.02
nings
1.02
emaker
0.99
ters
0.97
kel
0.97
Activations Density 0.023%