INDEX
Explanations
words related to sports figures or athletes
New Auto-Interp
Negative Logits
infringing
-0.73
underdog
-0.67
disparate
-0.66
flooding
-0.66
loopholes
-0.66
presumptive
-0.65
interchangeable
-0.64
scarcity
-0.64
dime
-0.62
stockpile
-0.62
POSITIVE LOGITS
oglu
1.23
icz
1.20
ansky
1.20
tein
1.18
inski
1.17
cki
1.17
owski
1.15
nen
1.13
eri
1.13
ijk
1.11
Activations Density 0.107%