INDEX
Explanations
individuals by their names
instances of the word "who" referring to individuals in various contexts
New Auto-Interp
Negative Logits
ulp
-0.68
Nec
-0.62
utation
-0.62
DDR
-0.60
Affordable
-0.60
Bound
-0.59
AGE
-0.59
ķ
-0.58
Georg
-0.58
reach
-0.58
POSITIVE LOGITS
oversaw
0.90
preceded
0.89
anwhile
0.88
resided
0.88
accompanies
0.88
owns
0.87
soever
0.87
oversees
0.85
famously
0.85
incidentally
0.84
Activations Density 0.093%