INDEX
Explanations
proper names of people or organizations
proper nouns, specifically names of people
New Auto-Interp
Negative Logits
uminati
-0.59
advertisement
-0.58
......
-0.58
âĢº
-0.57
lihood
-0.57
Occupations
-0.56
æĸ¹
-0.56
¶ħ
-0.55
Thanksgiving
-0.55
vironment
-0.54
POSITIVE LOGITS
aney
0.65
kson
0.61
Sov
0.60
rod
0.60
yang
0.60
mond
0.59
recalled
0.59
rice
0.59
guessed
0.58
enson
0.58
Activations Density 0.312%