INDEX
Explanations
proper nouns like "Randy"
mentions of specific individuals, particularly those named Randy
New Auto-Interp
Negative Logits
strings
-0.88
ivity
-0.81
ioch
-0.78
ivism
-0.77
oresc
-0.75
orescence
-0.75
ences
-0.73
orescent
-0.72
ogens
-0.70
unction
-0.68
POSITIVE LOGITS
Cout
0.91
wine
0.87
Olson
0.82
Savage
0.79
vich
0.78
croft
0.78
Castle
0.75
Bauer
0.73
Ples
0.71
Hendricks
0.70
Activations Density 0.022%