INDEX
Explanations
proper nouns, specifically the name "Vaughn"
references to specific individuals' names
New Auto-Interp
Negative Logits
inately
-0.71
Yamato
-0.71
tul
-0.68
selectively
-0.68
plat
-0.67
veter
-0.67
arrow
-0.66
Lama
-0.65
temporal
-0.65
opaque
-0.65
POSITIVE LOGITS
ranch
0.96
ifax
0.95
aunder
0.93
Vaughn
0.85
ashington
0.85
ipeg
0.82
ILLE
0.82
ooters
0.81
ISTORY
0.79
licks
0.78
Activations Density 0.022%