INDEX
Explanations
names of people and maybe co-authors of scholarly works
references to authors and contributors in scholarly works
New Auto-Interp
Negative Logits
resil
-0.57
puppies
-0.55
disrespect
-0.53
bathrooms
-0.53
surrounded
-0.51
regulated
-0.51
successors
-0.50
partying
-0.50
discriminated
-0.50
suites
-0.50
POSITIVE LOGITS
Shapiro
0.81
Schro
0.80
Barrett
0.78
Thompson
0.77
Doyle
0.77
Levin
0.76
McCl
0.76
McC
0.75
McK
0.75
Nolan
0.75
Activations Density 0.390%