INDEX
Explanations
names of individuals
proper nouns, specifically names and organizations
New Auto-Interp
Negative Logits
ividual
-0.74
explanatory
-0.73
Pastebin
-0.73
sidx
-0.72
dracon
-0.70
scatter
-0.69
aisle
-0.69
displayText
-0.69
chronological
-0.68
ioxide
-0.67
POSITIVE LOGITS
Wyn
0.88
Wong
0.84
Ni
0.80
Nguyen
0.78
imar
0.76
Cunningham
0.75
Norman
0.75
Nielsen
0.74
Thompson
0.73
Nap
0.72
Activations Density 0.373%