INDEX
Explanations
proper nouns, particularly names of people and places
proper nouns, particularly names associated with individuals and places
New Auto-Interp
Negative Logits
tp
-0.73
eem
-0.72
raged
-0.68
cess
-0.68
eq
-0.67
pered
-0.66
Urug
-0.65
Lt
-0.65
edited
-0.65
ioned
-0.65
POSITIVE LOGITS
Barron
1.03
sonian
0.87
Grimm
0.86
riages
0.80
agy
0.79
Webster
0.76
astics
0.76
agraph
0.74
baskets
0.73
oké
0.72
Activations Density 0.011%