INDEX
Explanations
proper nouns related to people
occurrences of names and titles of people
New Auto-Interp
Negative Logits
sylv
-0.76
sarc
-0.71
acebook
-0.70
sugg
-0.68
fict
-0.67
SOURCE
-0.67
idon
-0.66
avorite
-0.66
inarily
-0.64
clos
-0.64
POSITIVE LOGITS
Daw
0.94
Sabha
0.91
Gandhi
0.83
Sek
0.73
uri
0.72
Khalid
0.71
Yak
0.69
Chow
0.69
Krish
0.68
Sharma
0.67
Activations Density 0.094%