INDEX
Explanations
proper nouns
mentions of the name "Hart."
New Auto-Interp
Negative Logits
terday
-0.74
ntil
-0.65
ournal
-0.63
avorite
-0.62
shortcut
-0.61
ITAL
-0.61
Yon
-0.61
currency
-0.60
transsexual
-0.60
ĵĺ
-0.59
POSITIVE LOGITS
wig
1.30
mut
1.25
mann
1.17
enstein
1.15
nell
1.11
swick
1.10
igan
1.07
wick
1.06
ley
1.02
lett
1.01
Activations Density 0.026%