INDEX
Explanations
references to a specific person named Josh
the name "Josh" in various contexts
New Auto-Interp
Negative Logits
arching
-0.68
prevail
-0.66
bureaucr
-0.66
ragon
-0.65
porting
-0.64
cers
-0.63
cele
-0.63
confisc
-0.61
pired
-0.60
System
-0.59
POSITIVE LOGITS
Dug
0.87
aic
0.86
ima
0.81
McC
0.81
Sawyer
0.81
imal
0.80
Whedon
0.77
chens
0.76
arious
0.76
imon
0.74
Activations Density 0.025%