INDEX
Explanations
proper nouns, such as names of individuals, locations, and organizations
New Auto-Interp
Negative Logits
weave
-0.57
gauge
-0.55
icably
-0.55
Gandhi
-0.55
icable
-0.55
oppressed
-0.54
($)
-0.53
KKK
-0.53
1800
-0.53
Ethiop
-0.53
POSITIVE LOGITS
idon
1.03
EStream
0.84
Soup
0.76
stein
0.76
ornia
0.76
Fly
0.75
usky
0.74
cone
0.74
antine
0.72
glers
0.71
Activations Density 5.946%