INDEX
Explanations
mentions of the name "Ron" or variations of it
New Auto-Interp
Negative Logits
ی
-0.49
ylde
-0.46
buti
-0.44
yi
-0.43
arture
-0.43
<
-0.42
PDI
-0.40
APE
-0.39
."]
-0.39
YPE
-0.39
POSITIVE LOGITS
Ron
1.04
Ron
0.92
Ronald
0.91
RON
0.87
Ronald
0.82
ron
0.75
Ronnie
0.75
Ronan
0.72
ron
0.72
Monfieur
0.69
Activations Density 0.008%