INDEX
Explanations
the name "Cousins" at varying activation levels
mentions of specific individuals and organizations, particularly sports figures and statistical references
New Auto-Interp
Negative Logits
oku
-0.75
ock
-0.72
dog
-0.71
Beck
-0.70
ument
-0.67
ocking
-0.64
Friedman
-0.63
req
-0.63
ishment
-0.62
nings
-0.61
POSITIVE LOGITS
annabin
0.72
ensis
0.71
Cousins
0.70
grave
0.70
ursive
0.69
entric
0.69
whiff
0.68
itate
0.68
rouch
0.67
illian
0.67
Activations Density 0.047%